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"Peptides" 

Summary of the invention 

This invention relates to peptides which are fragments of 
protein products arising from frameshift mutations in 
genes, which peptides elicit T cellular immunity, and to 
cancer vaccines and compositions for anticancer treatment 
comprising said peptides. 

The invention further relates to a method for identifying 
such peptides which are fragments of protein products 
arising from frameshift mutations in genes, which may 
elicit T cellular immunity which is useful for combating 
cancer associated with said mutated genes . 

The invention also relates to DNA sequences encoding at 
least one frameshift mutant peptide, and vectors 
comprising at least one insertion site containing a DNA 
sequence encoding at least one frameshift mutant 
peptide . 

Further the invention relates to methods for the 
treatment or prophylaxis of cancers associated with 
frameshift mutations in genes by administration of at 
least one frameshift mutant peptide or a recombinant 
virus vector comprising at least one insertion site 
containing a DNA sequence encoding at least one 
frameshift mutant peptide, or an isolated DNA sequence 
comprising a DNA sequence encoding at least one 
frameshift mutant peptide. 

The present invention represents a further development of 

anticancer treatment or prophylaxis based on the use of 
peptides to generate activation and strengthening of the 
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anti cancer activity of the T cellular arm of the body's 
own immune system. 

Technical Background 
Tumour antigens, Status: 

T cell defined antigens have now been characterised in a 
broad spectrum of cancer types. These antigens can be 
divided into several main groups, depending on their 
expression. The two main groups are constituted by 
developmental differentiation related antigens 
(tumour- testis antigens, oncofoetal antigens etc., such 
as MAGE antigens and CEA) and tissue specific 
differentiation antigens (Tyrosinase, gplOO etc.). The 
group containing the truly tumour specific antigens 
contains proteins that are altered due to mutations in 
the genes encoding them. In the majority of these, the 
mutations are unique and have been detected in a single 
or in a small number of tumours. Several of these 
antigens seem to play a role in oncogenesis. 

Cancer vaccines, Status: 

The focus in cancer vaccine development has been on 
antigens expressed in a high degree within one form of 
cancer (such as melanoma) or in many kinds of cancers. One 
reason for this is the increased recruitment of patients 
into clinical protocols. The field is in rapid growth, 
illustrated by the accompanying table listing the cancer 
vaccine protocols currently registered in 
the PDQ database of NCI. 
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Inheritable cancer/cancer gene testing: 

Inherited forms of cancer occur at a certain frequency in 
5 the population. For several of these cancer forms, the 

underlying genetic defects have been mapped. This is also 
the case in Lynch syndrome cancers which constitute an 
important group of inheritable cancer. In families 
inflicted with this syndrome, family members inherit defect 
10 genes encoding DNA Mismatch Repair (MMR) Enzymes. Carriers 
of such MMR defects frequently develop colorectal cancer 
(HNPCC) and other forms of cancer (list?) . Mutations in MMR 
enzymes can be detected using gene testing in the same way 
as other cancer related genes can be detected. 

15 

Gene testing of risk groups in this case represents an 
ethical dilemma, since no acceptable forms for prophylactic 
treatment exist. At present surgery to remove the organ in 
danger to develop cancer has been the only treatment 
20 option. In' these patients, cancer vaccines will be a very 
(interesting) form of prophylaxis, provided efficient 
vaccines can be developed. 

The lack of efficient repair of mismatched DNA results in 
25 deletions and insertions in one strand of DNA, and this 
happens preferentially in stretches of DNA containing 
repeated units (repeat sequences) . Until now, focus has 
been on repeat sequences in the form of non^coding 
microsattelite loci. Indeed microsatteli te instability is 
30 the hallmark of cancers resulting from MMR defects. We 
have taken another approach, and have concentrated on 
frameshift mutations occurring in DNA sequences coding for 
proteins related to the oncogenic process. Such frameshift 
mutations result in completely new amino acid sequences 
35 in the C- terminal part of the proteins, prematurely 
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terminating where a novel stop codon appears. This results 
in two important consequences: 

1) The truncated protein resulting from the frameshift is 
generally nonfunctional, in most cases resulting in 
5 "knocking out" of an important cellular function. Aberrant 
proteins may also gain new functions such as the capacity 
to aggragate and form plaques. In both cases the frameshift 
results in disease. 

10 2) The short new C- terminal amino acid sequence resulting 
from the shift in the reading frame (the "frameshift 
sequence") is foreign to the body. It does not exist prior 
to the mutation, and it only exists in cells having the 
mutation, i.e. in tumour cells and their pre malignant 

15 progenitors. Since they are completely novel and 

therefore foreign to the immune system of the carrier, they 
may be recognised by T-cells in the repertoire of the 
carrier. So far, nobody has focused on this aspect of 
frameshift mutations, and no reports exist on the 

20 characterisation of frameshift peptides from coding regions 
of proteins as tumour antigens. This concept is therefore 
novel and forms the basis for developing vaccines based on 
these sequences. It follows that such vaccines may also be 
used prophyllactively in persons who inherit defective 

25 enzymes belonging to the MMR machinery. Such vaccines will 
therefore fill an empty space in the therapeutic 
armament against inherited forms of cancer. 

30 It has been shown that single amino acid substitutions in 
intracellular "self "-proteins may give rise to tumour 
rejection antigens, consisting of peptides differing in 
their amino acid sequence from the normal peptide. The T 
cells which recognise these peptides in the context of 

35 the major histocompatibility (MHC) molecules on the 

surface of the tumour cells, are capable of killing the 
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tumour cells and thus rejecting the tumour from the host. 

In contrast to antibodies produced by the B cells, which 
typically recognise a free antigen in its native 
5 conformation and further potentially recognise almost any 
site exposed on the antigen surface, T cells recognise an 
antigen only if the antigen is bound and' presented by a 
MHC molecule. Usually this binding will take place only 
after appropriate antigen processing, which comprises a 

10 proteolytic fragmentation of the protein, so that the 
resulting peptide fragment fits into the groove of the 
MHC molecule. Thereby T cells are enabled to also 
recognise peptides derived from intracellular proteins. T 
cells can thus recognise aberrant peptides derived from 

15 anywhere in the tumour cell, in the context of MHC 

molecules on the surface of the tumour cell, and can 
subsequently be activated to eliminate the tumour cell 
harbouring the aberrant peptide. 

20 M.Barinaga, Science, 257, 880-881, 1992 offers a short 
review of how MHC binds peptides. A more comprehensive 
explanation of the Technical Background for this 
Invention may be found in D. Male et al , Advanced 
Immunology, 1987, J . B . lippincott Company, Philadelphia. 

25 Both references are hereby included in their entirety. 

The MHC molecules in humans are normally referred to as 
HLA (human leukocyte antigen) molecules. They are., encoded 
by the HLA region on the human chromosome No 6. 

30 

The HLA molecules appear as two distinct classes 
depending on which region of the chromosome they are 
encoded by and which T cell subpopulations they interact 
with and thereby activate primarily. The class I 
35 molecules are encoded by the HLA A, B and C subloci and 
they primarily activate CD8 + cytotoxic T cells. The HLA 



BNSOOCiD: <WO 9958552A2 I > 



WO 99/58552 



• 



PCT/NO99/00143 



10 



15 



20 



25 



30 



class II molecules are encoded by the DR, DP and DQ 
subloci and primarily activate CD4+ T cells, both helper 
cells and cytotoxic cells. 

Normally every individual has six HLA Class I molecules, 
usually two from each of the three groups A, B and C. 
Correspondingly, all individuals have their own selection 
of HLA Class II molecules, again two from each of the 
three groups DP, DQ and DR. Each of the groups A, B, C 
and DP, DQ and DR are again divided into several 
subgroups. In some cases the number of different HLA 
Class I or II molecules is reduced due to the overlap of 
two HLA subgroups . 

All the gene products are highly polymorphic. Different 
individuals thus express distinct HLA molecules that 
differ from those of other individuals. This is the basis 
for the difficulties in finding HLA matched organ donors 
in transplantations. The significance of the genetic 
variation of the HLA molecules in immunobiology is 
reflected by their role as immune- response genes. Through 
their peptide binding capacity, the presence or absence 
of certain HLA molecules governs the capacity of an 
individual to respond to peptide epitopes. As a 
consequence, HLA molecules determine resistance or 
susceptibility to disease. 

T cells may control i-b<=> development and growth of cancer 
by a variety of mechanisms. Cytotoxic T cells, both HLA 
class I restricted CD8+ and HLA Class II restricted CD4+, 
may directly kill tumour cells carrying the appropriate 
tumour antigens. CD4+ helper T cells are needed for 
cytotoxic CD8+ T cell responses as well as for antibody 
responses, and for inducing macrophage and LAK cell 
killing . 
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A requirement for both HLA class I and II binding is that 
the peptides must contain a binding motif, which usually 
is different for different HLA groups and subgroups. A 
binding motif is characterised by the requirement for 
5 amino acids of a certain type, for instance the ones 

carrying large and hydrophobic or positively charged side 
groups, in definite positions of ■ the peptide so that a 
narrow fit with the pockets of the HLA binding groove is 
achieved. The result of this, taken together with the 

10 peptide length restriction of 8-10 amino acids within the 
binding groove, is that it is quite unlikely that a 
peptide binding to one type of HLA class I molecules will 
also bind to another type. Thus, for example, it may very 
well be that the peptide binding motif for the HLA-A1 and 

15 HLA-A2 subgroups, which both belong to the class I 

gender, are as different as the motifs for the HLA-Al and 
HLA-B1 molecules. 

For the same reasons it is not likely that exactly the 
20 same sequence of amino acids will be located in the 

binding groove of the different class II molecules . In 
the case of HLA class II molecules the binding sequences 
of peptides may be longer, and it has been found that 
they usually contain from 10 to 16 amino acids, some of 
25 which, at one or both terminals, are not a part of the 
binding motif for the HLA groove. 

However, an overlap of. the different peptide binding 
motifs of several HLA class I and class II molecules may 

30 occur. Peptides that have an overlap in the binding 

sequences for at least two different HLA molecules are 
said to contain "nested T cell epitopes". The various 
epitopes contained in a "nested epitope peptide" may be 
formed by processing of the peptide by antigen presenting 

35 cells and thereafter be presented to T cells bound to 
different HLA molecules. The individual variety of HLA 
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molecules in humans makes peptides containing nested 
epitopes more useful as general vaccines than peptides 
that are only capable of binding to one type of HLA 
molecule . 

5 

Effective vaccination of an individual can only be 
achieved if at least one type of HLA class 1 and/or II 
molecule in the patient can bind a vaccine peptide either 
in it's full length or as processed and trimmed by the 
10 patient's own antigen presenting cells. 

The usefulness of a peptide as a general vaccine for the 
majority of the population increases with the number of 
different HLA molecules it can bind to, either in its 
15 full length or after processing by antigen presenting 
cells . 

In order to use peptides derived from a protein encoded 
by a mutated gene as vaccines or anticancer agents to 
20 generate anti tumour CD4+ and/or CD8+ T cells, it is 

necessary to investigate the mutant protein in question 
and identify peptides that are capable, eventually after 
processing to shorter peptides by the antigene presenting 
cells, to stimulate T cells. 



Prior art 

In our International Application PCT/NO92/00032 
30 (published as W092/14756), we described synthetic 

peptides and fragments of oncogene protein products which 
have a point of mutation or translocations as compared to 
their proto-oncogene or tumour suppressor gene protein. 
These peptides correspond to, completely cover or are 
35 fragments of the processed oncogene protein fragment or 
tumour suppressor gene fragment as presented by cancer 



25 
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cells or other antigen presenting cells, and are 
presented as a HLA-peptide complex by at least one allele 
in every individual; These peptides were also shown to 
induce specific T cell responses to the actual oncogene 
protein fragment produced by the cell by processing and 
presented in the HL»A molecule. In particular, we 
described peptides derived from the p21 ras protein which 
had point mutations at particular amino acid positions, 
namely position 12, 13 and 61. These peptides have been 
shown to be effective in regulating the growth of cancer 
cells in vitro. Furthermore, the peptides were shown to 
elicit CD4 + T cell immunity against cancer cells 
harbouring the mutated p21 ras oncogene protein through 
the administration of such peptides in vaccination or 
cancer therapy schemes. Later we have shown that these 
peptides also elicit CD8+ T cell immunity against cancer 
cells harbouring the mutated p21 ras oncogene protein 
through the administration mentioned above. 

However, the peptides described above will be useful only 
in certain number of cancers, namely those which involve 
oncogenes with point mutations or translocation in a 
proto-oncogene or tumour suppressor gene. There is 
therefore a strong need for an anticancer treatment or 
vaccine which will be effective against a more general 
range of cancers. 

In general, tumors are very heterogenous with respect to 
genetic alterations found in the tumour cells. This 
implies that both the potential therapeutic effect and 
prophylactic strength of a cancer vaccine will increase 
with the number of targets that the vaccine is able to 
elicit T cell immunity against. A multiple target vaccine 
will also reduce the risk of new tumour formation by 
treatment escape variants from the primary tumour. 



WO 99/58552 



10 




PCT/NO99/00143 



Definition of Problem solved by the Invention. 

There is a continuing need for new anticancer agents based 
on antigenic peptides giving rise to specific T cellular 
5 responses and toxicity against tumours and cancer cells 
carrying genes with mutations related to cancer. The 
present invention will contribute largely to supply new 
peptides that can have a use in the combat and prevention 
of cancer as ingredients in a multiple target anti -cancer 
10 vaccine. 

Another problem solved by the present invention is that 
a protection or treatment can be offered to the 
individuals belonging to family's or groups with high 

15 risk for hereditary cancers. Hereditary cancers are in 
many cases associated with genes susceptible to 
frameshift mutations as described in this invention (i.e. 
mutations in mismatch repair genes) . Today it is possible 
to diagnose risk of getting hereditary cancer but no 

20 pharmaceutical method for protection against the onset of 
the cancer is available. 

Definition of the Invention 

25 

A main object of the invention is to obtain peptides 
corresponding to peptide fragments of mutant proteins 
produced by cancer cells which can be used to stimu 1 , : s T 
cells . 



Another main object of the invention is to develop a 
cancer therapy for cancers based on the T cell immunity 
which may be induced in patients by stimulating their T 
cells either in vivo or in vitro with the peptides 
35 according to the invention. 



30 
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A third main object of the invention is to develop a 
vaccine to prevent the establishment of or to eradicate 
cancers based solely or partly on peptides corresponding 
to peptides of the present invention which can be used to 
5 generate and activate T cells which produce cytotoxic T 

cell immunity against cells harbouring the mutated genes. 

A fourth main object of the invention is to design an 
• ^ anticancer treatment or prophylaxis specifically adapted 

10 to a human individual in need of such treatment or 

prophylaxis, which comprises administering at least one 
peptide according to this invention. 

These and other objects of the invention are achieved by 
15 the attached claims. 

Since frameshift mutations result in premature stop 
codons and therefore deletion in large parts of the 
proteins, proteins with frameshift mutations have 

20 generally not been considered to be immunogenetic and 
have therefore not been considered as targets for 
immunotherapy. Thus it has now surprisingly been found 
that a whole group of new peptides resulting from 
frameshift mutations in tumour relevant genes are useful 

25 for eliciting T cell responses against cancer cells 
harbouring genes with such frameshift mutations. 

Genes containing a mono nucleoside base repeat sequence 
of at least five residues, for example of eighth 

30 deoxy adenosine bases (AAAAAAAA) , or a di-nucleoside base 
repeat sequence of at least four di-nucleoside base 
units, for example of two deoxyadenosine-deoxycytosine 
units (ACAC) , are susceptible to frameshift mutations. 
The frameshift mutations occur, respectively, either by 

35 insertion of one or two of the mono-nucleoside base 

residue' or of one or two of the di-nucleoside base unit 
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in the repeat sequence, or by deletion of one or two of 
the mono-nucleoside base residue or of one or two of the 
di-nucleoside base unit from the repeat sequence. A gene 
with a frameshift mutation will from the point of 
mutation code for a protein with a new and totally 
different amino acid sequence as compared to the normal 
gene product. This mutant protein with the new amino acid 
sequence at the carboxy end will be specific for all 
cells carrying the modified gene. 

In the remainder of this specification and claims the 
denomination frameshift mutant peptides will comprise 
such proteins and peptide fragments thereof. 

It has now according to the present invention been found 
that such new protein sequences arising from frameshift 
mutations in genes in cancer cells give rise to tumour 
rejection antigens that are recognised by T cells in the 
context of HLA molecules. 

It has further according to the present invention been 
found a group of peptides corresponding to fragments of 
mutant proteins arising from frameshift mutations in 
genes in cancer cells which can be used to generate T 
cells. The said peptides can therefore also be used to 
rise a T cell activation against cancer cells harbouring 
a gene with a frameshift mutation as described above. 

These peptides are at least 8 amino acids long and 
correspond, either in their full length or after 
processing by antigen presenting cells, to the mutant 
gene products or fragments thereof produced by cancer 
cells in a human patient afflicted with cancer. 

A peptide according to this invention is characterised in 
that it 
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a) is at least 8 amino acids long and is a 
fragment of a mutant protein arising from a 
frameshift mutation in a gene of a cancer cell; 



and 



b) consists of at least one amino acid of the 
mutant part of a protein sequence encoded by said 
10 gene; 



and 



c) comprises 0-10 amino acids from the carboxyl 
15 terminus of the normal part of the protein 

sequence preceding the amino terminus of the 
mutant sequence and may further extend to the 
carboxyl terminus of the mutant part of the 
protein as determined by a new stop codon 
20 generated by the frameshift mutation in the gene; 



and 



d) induces, either in its full length or after 
25 processing by antigen presenting cell, T cell 

responses . 



The peptides of this invention contain preferably 8-25, 
30 9-20, 9-16, 8-12 or 20-25 amino acids. They may for 
instance contain 9, 12, 13, 16 or 21 amino acids. 



It is most preferred that the peptides of the present 
35 invention are at least 9 amino acids long, for instance 
9-18 amino acids long, but due to the processing 
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possibility of the antigen presenting cells also longer 
peptides are very suitable for the present invention. Thus 
the whole mutant amino acid sequence may be used as a 
frameshift mutant peptide according to the present 
5 invention, if it comprises 8 amino acids or more. 

The invention further relates to a method for vaccination 
of a person disposed for cancer, associated with a 
frameshift mutation in a gene, consisting of administering 
10 at least one peptide of the invention one or more times in 
an amount sufficient for induction of T-cell immunity to 
the mutant proteins encoded by the frameshift mutated gene. 

The invention also relates to a method for treatment of a 
15 patient afflicted with cancer associated with frameshift 

mutation in genes, consisting of administering at least one 
peptide of the invention one or more times in an amount 
sufficient for induction of T-cell immunity to mutant 
proteins arising from frameshift mutations in the genes of 
20 cancer cells. 

Furthermore, it has according to the present invention been 
found a method for identifying new peptides which 
correspond to fragments of proteins arising from frameshift 
25 mutations in genes. This method is characterised by the 
following steps: 

1) identifying a gene in a cancer cell susceptible 
to frameshift mutation by having a mono nucleoside 
30 base repeat sequence of at least five residues, or 

a di-nucleoside base repeat sequence of at least 
four di-nucleoside base units; 

and 

35 
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2) removing, respectively, one nucleoside base 
residue or one di -nucleoside base unit from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 

5 sequence as far as to include a new stop codon; 

and/ or 

3) removing, respectively, two nucleoside base 

10 residues or two di-nucleoside base units from the 

repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon; 

15 and/or 

4) inserting, respectively, one nucleoside base 
residue or one di-nucleoside base unit in the 
repeat sequence and identifying the amino acid 

20 sequence of the protein encoded by the altered gene 

sequence as far as to include a new stop codon; 

and/or 

25 5) inserting, respectively, two nucleoside base 

residues or two di-nucleoside base units in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
sequence as far as to include a new stop codon. 

30 

In order to determine whether the peptides thus 
identified are useable in the compositions and methods 
according to the present invention for the treatment or 
35 prophylaxis of cancer, the following further step should 
be performed: 
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6) determining whether the new peptide, either in 
their full length or as shorter fragments of the 
peptides, are able to stimulate T cells. 

5 

Optionally a further step may be added as follows: 

7) determining peptides containing nested epitopes 
for different major HLA class I and/or HLA class II 

10 molecules. 



Detailed Description of the invention. 

15 

In the present description and claims, the amino acids 
are represented by their one letter abbreviation as known 
in the art. 

20 The peptides of the present invention shall be explicitly 
exemplified through two different embodiments, wherein 
cancer develops based on frameshift mutations in specific 
genes, namely the BAX gene and TGF|3rii gene: 

25 I) BAX gene 

It has been established that the BAX gene is involved in 
regulation of survival or death of cells by promoting 
apoptosis. The human BAX gene contains a repeat sequence 
of eight deoxyguanosine bases (G8) in the third exon, 

30 spanning codons 3 8 to 41 (A TG GGG GGG GA G ) . 

Frameshift mutations in this G8 repeat have been 
observed, both as G7 (AT G GGG GGG AGG) and G9 (A TG GGG 
GGG GGA ) repeats, both in colon cancer cells and prostate 
35 cancer cells. The occurency is more than 50% of the 

examined cases (Rampino, N. et al . , "Somatic frameshift 
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mutations in the BAX gene in colon cancers of the 
microsatellite mutator phenotype . " , Science (Washington 
DC), 275: 967-969, 1997). The modified BAX gene products 
are unable to promote apoptosis and thus makes further 
5 tumour progress possible. Furthermore the modified gene 

products are only found in cancer cells and are therefore 
targets for specific immunotherapy. 

According to the present invention, peptides 
10 corresponding to the transformed BAX protein products 

arising from frameshift mutations in the BAX gene can be 
used as anticancer therapeutical agents or vaccines with 
the function to trigger the cellular arm of the immune 
system (T-cells) against cancer cells in patients 
15 afflicted with cancers associated with a modified BAX 
gene . 

Frameshift mutations in the BAX gene result in mutant 
peptide sequences with the first amino acid of the 
20 altered sequence in position 41 as compared to the normal 
BAX protein (Table 1, seq.id. no. 1 to 4 ) . 

Table 1 

amino acid pos 41 51 61 71 

25 normal bax peptide ; EAPELALDPV PQDASTKKLS ECLKRIGDEL 
DS . . . 

seq.id.no. 1 (bax-lG) ; RHPSWPWTRC LRMRPPRS 
seq.id.no. 4 (bax+2G) ; GRHPSWPWTR CLRMRPPRS 

seq.id.no. 2 (bax-2G) ; GTRAGPGPGA SGCVHQEAER VSQAHRGRTG Q 
30 seq.id.no. 3 (bax+lG) ; GGTRAGPGPG ASGCVHQEAE RVSQAHRGRT GQ 



35 
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Table 2 shows one group of peptides according to the 
present invention: 



Table 2 

seq.id.no. 5 

seq.id.no. 6 

seq.id.no. 7 

seq.id.no. 8 



IQDRAGRMGGRHPSWPWTRCLRMRPPRS 
I QDRAGRMGGGRHP S WPWT 

I QDRAGRMGGGGTRAG PG PGAS GC VHQE AERVS Q AHRGRTGQ 
I QDRAGRMGGGTRAGPGPG 



The peptides listed in Table 3 were used for in vitro 
generation of T cells that recognise mutant BAX peptides 



Table 3 . 
seq id no 1 
seq id no 9 
seq id no 6 
seq id no 10 
seq id no 11 
seq id no 12 
seq id no 8 



RHPSWPWTRCLRMRPPRS 

1 QDRAGRMGGRHPSWPWTRCLR 

IQDRAGRMGGGRHPSWPWT 

ASGCVHQEAERVSQAHRGRTGQ 

GGTRAGPGPGASGCVHQEAERV 

1 QDRAGRMGGGGTRAG PGPGAS 

I QDRAGRMGGGTRAGPGPG 



The most preferred peptides according to this embodiment 
of the present invention are listed in Table 4: 



Table 4 






seq 


id 


no 


1 


: RHPSWPWTRCLRMRPPRS 


seq 


id 


no 


2 


GTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


3 


: GGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


4 


: GRHPSWPWTRCLRMRPPRS 


seq. 


id 


. no 


. 5 


: IQDRAGRMGGRHPSWPWTRCLRMRPPRS 


seq. 


id 


.no 


. 6 


IQDRAGRMGGGRHPSWPWT 


seq. 


id 


no . 


7 : 


IQDRAGRMGGGGTRAGPGPGASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


8: 


I QDRAGRMGGGTRAGPGPG 


seq 


id 


no 


9: 


I QDRAGRMGGRHPSWPWTRCLR 


seq 


id 


no 


10: 


ASGCVHQEAERVSQAHRGRTGQ 


seq 


id 


no 


11 : 


GGTRAGPGPGASGCVHQEAERV 
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seq id no 12 : IQDRAGRMGGGGTRAGPGPGAS 
2) TGFpRII 

It has been established that the TGFpRII gene is involved 
5 in regulation of cell growth. TGFPRII is a receptor for 
TGFP which down regulates cell growth. The human gene 
coding for TGFPRII contains a repeat sequence of ten 
deoxyadenosine bases (A10) from base no. 709 to base no. 
718 (G AA AAA AAA AAG CCT) . In colon cancers and 

10 pancreatic cancers frameshift mutations in this A10 

repeat have been observed, both as A9 ( GAA AAA AAA AG C 
CT) and All (G AA AAA AAA AAA GCC) repeats, in 
approximately 80 % of the examined cases (Yamamoto, H. , 
"Somatic frameshift mutations in DNA mismatch repair and 

15 proapoptosis genes in hereditary nonpolyposis colorectal 
cancer. " , Cancer Research 58, 997-1003, March 1, 1998). 
The modified TGFPRII gene products are unable to bind 
TGFp and the signal for down regulation of cell growth 
is eliminated and thus makes further tumour progress 

20 possible. Furthermore the modified gene products are only 
found in cancer cells and are therefore targets for 
immunotherapy . 

Consequently peptides corresponding to the transformed 
25 TGFPRII protein products arising from frameshift 

mutations in the TGFPRII gene can be used as anticancer 
therapeutical agents or vaccines with the function to 
trigger the cellular arm of the immune system (T-cells) 
against cancer cells in patients afflicted with cancers 
30 associated with a modified TGFPRII gene. 

Frameshift mutations in the TGFPRII gene result in mutant 
peptide sequences with the first amino acid of the 
altered sequence in either position 133 (one and two base 
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deletions) or 134 (one and two base insertions) as 
compared to the normal TGFPRII protein (Table 5, 
seq.id.nos. 13 and 21). 

5 Table 5. 

amino acid pos . 133 

normal TGFpRII ; K PGETFFMCSC SSDECNDNII F SEE YNTSNP 

DLLL 

seq id no 13(-1A); S LVRLSSCVPV ALMSAMTTSS SQKNITPAIL TCC 
10 seq id no 13 (+2A) ; SLVRLSSCVP VALMSAMTTS SSQKNITPAI 

LTCC 

TGFbRII + 1A) ; AW 

TGFbRII - 2 A) ; A W 

15 Table 6 shows one groups of peptides of this invention: 
Table 6 

seq id no 14 : SPKCIMKEKKSLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 
seq id no 15 : PKCIMKEKKKSLVRLSSCV 
20 seq id no 19 : SPKCIMKEKKAW 
seq id no 20 : PKCIMKEKKKAW 

Table 7 presents peptides that were used for in vitro 
generation of T cells that recognise mutant TGF0RII 
25 peptides . 



30 



35 



Table 7 






seq 


id 


no 


15 : 


PKCIMKEKKKSLVRLSSCV 


seq 


id 


no 


16 : 


ALMSAMTTSSSQKNITPAILTCC 


seq 


id 


no 


17 : 


SLVRLS SCVPVALMS AMTTS S SQ 


seq 


id 


no 


18 : 


S PKC IMKEKKS LVRL S SC VPVA 


seq 


id 


no 


19: 


SPKCIMKEKKAW 


seq 


id 


no 


20: 


PKCIMKEKKKAW 


seq 


id 


no 


21: 


AMTTSSSQKNITPAILTCC 


seq 


id 


no 


428 : 


SLVRLS SCV 
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The most preferred peptides of this embodiment of the 
present invention are : 



Table 8 






seq 


id 


no 


13 


: SLVRLSSCVPVALMSAMTTSSSQKNITPAILTCC 


seq 


id 


no 


14 


: bPKCXiyLK&KKbljvKJjbbCVP 


seq 


id 


no 


15 


: PKCIMKEKKKSLVRLSSCV 


seq 


id 


no 


16 


: ALMSAMTTSSSQKNITPAILTCC 


seq 


id 


no 


17 


: SLVRLSSCVPVALMSAMTTSSSQ 


seq 


id 


no 


18 


: SPKCIMKEKKSLVRLSSCVPVA 


seq 


id 


no 


19 


: SPKCIMKEKKAW 


seq 


id 


no 


20 


: PKC IMKEKKKAW 


seq 


id 


no 


21 


: AMTTSSSQKNITPAILTCC 


seq 


id 


no428 


: SLVRLSSCV 



Other peptides of the invention can be fragments of the 
peptides listed in the Tables 1-8 above. Such fragments 
are most preferred from 9-16 amino acids long and include 
20 at least one amino acid from the mutant part of the 
protein . 

As used in this description and claims the term fragment 
is intended to specify a shorter part of a longer peptide 
25 or of a protein. 

Other cancer associated genes containing repeat sequences 
of. a nucleoside base and which therefore are susceptible 
to frameshift mutations and consequently are potential 
30 candidates for peptides according to the present 

invention (seq id nos according to table 9 are given in 
parentheses in each case) are the following: 

Human TGF-p-2 ( hTGF(32 ) gene (seq id nos 2 2-29) 
35 Deleted in colorectal cancer (DCC) gene (seq. id. nos. 
30-34) 
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Human breast and ovarian cancer susceptibility (BRCA1) 
gene (seq.id.nos. 378-387) 

Human breast cancer susceptibility (BRCA2) gene 
(seq.id.nos. 35-94) 

Human protein tyrosine phosphatase (hPTP) gene 
(seq.id.nos. 95-102) 

Human DNA topoisomerase II (top2) gene (seq.id.nos. 
103-108) 

Human kinase (TTK) gene (seq.id.nos. 109-120) 

Human transcriptional repressor (CTCF) gene (seq.id.nos. 

121-127) 

Human FADD- homologous ICE/CED-3 -like protease gene 
(seq.id.nos. 128-133) 

Human putative mismatch repair/binding protein (hMSH3) 
gene (seq.id.nos. 134-147) 

Human retinoblastoma binding protein 1 isoform I (hRBPl) 
gene (seq.id.nos. 148-156) 

Human FMR1 (hFMRl) gene (seq.id.nos. 157-161) 
Human TINUR gene (seq.id.nos. 162-169) 
b-raf oncogene (seq.id.nos. 170-175) 

Human neurof ibromin (NF1) gene (seq.id.nos. 176-181) 

Human germline n-myc gene (seq.id.nos. 182-188) 

Human n-myc gene (seq.id.nos. 189-194) 

Human ras inhibitor gene (seq.id.nos. 195-199) 

Human hMSH6 gene (seq.id.nos. 200-203 and 293-297) 

Human nasopharynx carcinoma EBV BNLF-1 gene (seq.id.nos. 

204 210) 

Human cell cycle regulatory protein (ElA-binding protein) 
p300 gene (seq.id.nos. 211-218) 

Human B-cell lymphoma 3 -encoded protein (bcl-3) gene 
(seq.id.nos. 219-226) 

Human transforming growth factor-beta induced gene 

product (BIGH3) (seq.id.nos. 227-232) 

Human transcription factor ETV1 gene (seq.id.nos. 

233-239) 
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Human insulin-like growth factor binding protein (IGFBP4) 

gene (seq.id.nos. 240-246) 

Human MUC1 gene (seq.id.nos. 247-266) 

Human protein- tyrosine kinase (JAK1) gene (seq.id.nos. 
5 267-271) 

Human protein- tyrosine kinase (JAK3) gene (seq.id.nos. 
272-279) 

Human Flt4 gene (for transmembrane tyrosinase kinase) 
(seq. id.nos . 2 8 0-284 ) 
10 Human p53 associated gene (seq.id.nos. 285-292) 
Human can (hCAN) gene (seq.id.nos. 298-300) 
Human DBL (hDBL) proto-oncogene / Human MCF2P0 (hMCF2P0) 
gene (seq.id.nos. 301-306) 

Human dek (hDEK) gene (seq.id.nos. 3 07-3 09) 
15 Human retinoblastoma related protein (pl07) gene 
(seq.id.nos. 310-313) 

Human G protein-coupled receptor (hGPRl) gene 
(seq.id.nos. 314-319) 

Human putative RNA binding protein (hRBP56) gene 
20 (seq.id.nos. 320-325) 

Human transcription factor (hITF-2) gene (seq.id.nos. 
326-327) 

Human malignant melanoma metastasis-supressor (hKiSS-1) 
gene (seq.id.nos. 328-334) 
25 Human telomerase-associated protein TP-1 (hTP-1) gene 
(seq. id.nos . 3 3 5-348) 

Human FDF-5 (hFDF-5) gene (seq.id.nos. 349-356)- 
Human metastasis-assosiated mtal (hMTAl) gene 

(seq. id.nos . 3 57-3 62 ) 
30 Human transcription factor TFIIB 90 kDa subunit 

(hTFIIB90) gene (seq id nos 363-369) 

Human tumour suppressor (hLUCA-1) gene (seq id nos 

370-377) 

Human Wilm's tumour (WIT-1) associated protein (seq id 
35 nos 388-393) 
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Human cysteine protease (ICErel-III) gene (seq id nos 
394-398 and 459) 

Human Fas ligand (FasL) gene (seq id nos 3 99-403) 
Human BRCA1 -associated RING domain protein (BARD1) gene 
(seq id nos 404-417) 

Human mcf.2 (hMCF.2) gene (seq id nos 418-422) 
Human Fas antigen (fas) gene (seq id nos 423-427) 
Human DPC4 gene (seq id nos 429-437) . 

The mutant peptides that are the results of frameshift 
mutation in these genes, in accordance with the present 
invention, are listed in table 9. 

Table 9 

seq id no 22; TVGRPHISC 

seq id no 23; KTVGRPHISC 

seq id no 24; KQWEDPTS PANVI ALLQT 

seq id no 25; QWEDPTS PANVI ALLQT 

seq id no 26; QKTIKSTRKKTVGRPHISC 

seq id no 27; QKT I K STRKKKTVGRPH ISC 

seq id no 28; QKT I KSTRKKKQWEDPTS PANV I ALLQT 

seq id no 29; QKTIKSTRKKQWEDPTSPANVIALLQT 

seq id no 30; AADLQQQF VHFLDCWDVS S I PFTLHL PQAQDI TT 

seq id no 31; GKDAKEKSS 

seq id no 32; GKDAKEKKSS 

seq id no 33; GKDAKEKKAADLQQQFVHFLDCWDVSS I PFTLHLPQAQDITT 

seq id no 34; GKDAKEKAADLQQQFVHFLDCWDVS S I PFTLHL PQAQDITT 

seq id no 35; FSMKQTLMNVKNLKTK 

seq id no 36; KF SMKQTLMNVKNLKTK 

seq id no 37; VRTSKTRKKFSMKQTLMNVKNLKTK 

seq id no 38; VRTSKTRKKKF SMKQTLMNVKNLKTK 

seq id no 39; VRT SKTRKKNF P 

seq id no 40; VRT SKTRKNF P 

seq id no 41; I KKKLLQFQK 

seq id no 42; K I KKKLLQFQK 

seq id no 43; KSRRNYFNFKNNCQSRL 
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seq 


id 


no 


44; 


SRRNYFNFKNNCQSRL 


seq 


id 


no 


45; 


TNLRVI QK I KKKLLQFQK 


seq 


id 


no 


46; 


TNLRVI QKK I KKKLLQFQK 


seq 


id 


no 


47; 


TNLRVI QKKSRRNYFNFKNTSfCQSRL 


seq 


id 


no 


48; 


TNLRVIQKSRRNYFNFKNNCQSRL 


seq 


id 


no 


49; 


KIMIT 


seq 


id 


no 


50; 


NIDKIPEKIMIT 


seq 


id 


no 


51; 


NIDKIPEKKIMIT 


seq 


id 


no 


52; 


IINAN 


seq 


id 


no 


53; 


KIINAN 


seq 


id 


no 


54; 


NDKTVSEKI INAN 


seq 


id 


no 


55; 


NDKTVSEKKI INAN 


seq 


id 


no 


56; 


NGLEKEYLMVNQKE 


seq 


id 


no 


57; 


SQTSLLEAKNGLEKEYLMVNQKE 


seq 


id 


no 


58; 


SQTSLLEAKKNGLEKEYLMVNQKE 


seq 


id 


no 


59; 


SQTSLLEAKKMA 


seq 


id 


no 


60; 


SQTSLLEAKMA 


seq 


id 


no 


61; 


TLVFPK 


seq 


id 


no 


62; 


KTLVFPK 


seq 


id 


no 


63; 


LKNVEDQKTLVF PK 


seq 


id 


no 


64; 


LKNVEDQKKTLVFPK 


seq 


id 


no 


65; 


LKNVEDQKKH 


seq 


id 


no 


66; 


LKNVEDQKH 


seq 


id 


no 


67; 


KKIQLY 


seq 


id 


no 


.68; 


KKKIQLY 


seq 


id 


no 


69; 


RKRF S YTE YL AS 1 1 RF I F SVNRRKE I QNL S S CNFKI 


seq 


id 


no 


70; 


LRIVSYSKKKKIQLY 


seq 


id 


no 


71; 


LRIVSYSKKKKKIQLY 


seq 


id 


no 


72; 


LRIVSYSKKRKRFSYTEYLASIIRFIFSVNRRKEIQNLS- 
-SCNFKI 


seq 


id 


no 


73; 


LRIVSYSKRKRFSYTEYLASIIRFIFSVNRRKEIQNLS- 



-SCNFKI 



seq 


id 


no 


74; 


QDLPLSSICQTIVTIYWQ 


seq 


id 


no 


75; 


KQDL PL S S I CQT I VT I YWQ 


seq 


id 


no 


76; 


NRTC PFRLF VRRMLQFTGNKVLDRP 


seq 


id 


no 


77; 


GFWSWKKQDLPLSSICQTIVTIYWQ 
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seq id no 78; GFWSWKKKQDLPLSSICQTIVTIYWQ 
seq id no 79; GFWSWKKNRTCPFRLFVRRMLQFTGNKVLDRP 
seq id no 80; GFWSWKNRTCPFRLFVRRMLQFTGNKVLDRP 
seq id no 81; YRKTKNQN 
5 seq id no 82; K YRKTKNQN 
seq id no 83; NTERPKIRTN 
seq id no 84; DETFYKGKK YRKTKNQN 
seq id no 85; DETFYKGKKKYRKTKNQN 
seq id no 86; DETF YKGKKNTERPK IRTN 
10 seq id no 87; DETFYKGKNTERPKIRTN 

seq id no 88; LS INNYRFQMKF YFRFTSHGS PFTS ANF 
seq id no 89; KL S INNYRFQMKFYFRFTSHGS PFTSANF 
seq id no 90; NSVSTTTGFR 

seq id no 91; NIQLAATKKLSINNYRFQMKFYFRFTSHGSPFTSANF 
15 seq id no 92; NIQLAATKKKLSINNYRFQMKFYFRFTSHGSPFTSANF 
seq id no 93; NIQLAATKKNSVSTTTGFR 
seq id no 94; NI QLAATKNSVSTTTGFR 
seq id no 95; MEHVAPGRMSAS PQSPTQ 
seq id no 96; KMEHVAPGRMSASPQSPTQ 
20 seq id no 97; KWSTWLQAECQHLHSPQRSDKPQQAGLDQQHHCFALDS- 

- S PGPRPVFLQLLGLMGQGRHD 
seq id no 98; WSTWLQAECQHLHSPQRSDKPQQAGLDQQHHCFALDS- 

- S PGPRPVFLQLLGLMGQGRHD 
seq id no 99; TFSVWAEKMEHVAPGRMSAS PQSPTQ 
25 seq id no 100; TFSVWAEKKMEHVAPGRMS AS PQSPTQ 

seq id no 101; TF S VWAEKKWSTWLQAECQHLHS PQRSDKPQQAGLDQ - 

-QHHCFALDS SPGPRPVFL OLLGLMGQGRHD 
seq id no 102; TFSVWAEKWSTWLQAECQHLHSPQRSDKPQQAGLDQ- 

- QHHCFALDS S PGPRPVFLQLLGLMGQGRHD 
30 seq id no 103; HKWLKFC LLRLVKE S FHE 

seq id no 104; KHKWLKFCLLRLVKESFHE 

seq id no 105; KGGKAKGKKHKWLKFC LLRLVKE S FHE 

seq id no 106; KGGKAKGKKKHKWLKFCLLRLVKESFHE 

seq id no 107; KGGKAKGKKNTNG 

35 seq id no 108; KGGKAKGKNTNG 

seq id no 109; VNNFFKKL 
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seq 


id 


no 


110; 


KVNNFFKKL 




seq 


id 


no 


111; 


LSQGNVKKVNNFFKKL 




seq 


id 


no 


112; 


LSQGNVKKKVNNFFKKL 




seq 


id 


no 


113; 


GEKNDLQLFVMSDRRYKIYWTVILLNPCGNLHLKTTSL 


5 


seq 


id 


no 


114; 


KG EKNDLQLF VM S DRRYK I YWTV I LLNPCGNLHLKTT S L 




seq 


id 


no 


115; 


KGKKMICSYS 




seq 


id 


no 


116; 


GKKMICSYS 




seq 


id 


no 


117; 


SSKTFEKKGEKNDLQLFVMSDRRYKIYWTVILLNPCGN- 
-LHLKTTSL 


10 


seq 


id 


no 


118; 


SSKTFEKKKGEKNDLQLFVMSDRRYKIYWTVILLNPCGN- 
-LHLKTTSL 




seq 


id 


no 


119; 


S SKTFEKKKGKKMI C SYS 




seq 


id 


no 


12 0; 


SSKTFEKKGKKMICSYS 




seq 


id 


no 


121 ; 


QRKPKRANCVIQRRAKM 


15 


seq 


id 


no 


122; 


KQRK PKRANC V I QRRAKM 




seq 


id 


no 


123; 


NKENQKEQTALLYRGGQRCRCVCLRF 




seq 


id 


no 


123; 


NKENQKEQTALLYRGGQRCRCVCLRF 




seq 


id 


no 


124 ; 


PD YQ P PAKKQRK PKRANC V I QRRAKM 




seq 


id 


no 


125; 


PDYQPPAKKKQRKPKRANCVI QRRAKM 


20 


seq 


id 


no 


12 6; 


PDYQPPAKKNKENQKEQTALLYRGGQRCRCVCLRF 




seq 


id 


no 


127; 


PD YQ P P AKNKENQKE QT ALL YRGG QRC RC VC LRF 




seq 


id 


no 


12 8; 


NLSSLLI 




seq 


id 


no 


129; 


TCLPF 




seq 


id 


no 


130; 


QPTFTLRKNLSSLLI 


25 


seq 


id 


no 


131; 


QPTFTLRKKNLSSLLI 




seq 


id 


no 


13 2; 


QPTFTLRKKTCLPF 




seq 


id 


no 


133; 


QPTFTLRKTCLPF 




seq 


id 


no 


13 4; 


RATFLLSLWECSLPQARLCLIVSRTGLLVQS 




seq 


id 


no 


13 5; 


GQHFYWHCGSAACHRRGCV 


30 


seq 


id 


no 


13 6; 


KENVRDKKRATFLLSLWECSLPQARLCLIVSRTGLLVQS 




seq 


id 


no 


137; 


KENVRDKKKRATF LL S LWEC S L PQ ARLC L I VS RTG L L VQ S 




O O 


id 




J — JO/ 






seq 


id 


no 


139; 


KENVRDKKGQHFYWHCGSAACHRRGCV 




seq 


id 


no 


140; 


ITHTRWGITTWDSWSVRMKANWIQAQQNKSLILSPSFTK 


35 


seq 


id 


no 


141; 


K I THTRWG I TTWDS WS VRMKANWI Q AQQNK SL I L S P S FTK 




seq 


id 


no 


142; 


KLLTPGGEL PHG I LGQ 
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seq id no 143; LLTPGGELPHGILGQ 

seq id no 144; PPVCELEKITHTRWGITTWDSWSVRMKANWIQAQQNKS- 

-LILSPSFTK 

seq id no 145; PPVCELEKKI THTRWG ITTWDSWSVRMKANWI QAQQNKS - 
5 -LILSPSFTK 

seq id no 146; PPVCELEKKLLTPGGELPHGILGQ 
seq id no 147; PPVCELEKLLTPGGELPHGILGQ 
seq id no 148; SLKDELEKMKI 
seq id no 149; SLKDELEKKMKI 
10 seq id no 150; LGQ S S PEKKNKN 
seq id no 151; LGQSSPEKNKN 
seq id no 152; RLRRINGRGSQIRSRNAFNRSEE 
seq id no 153; EPKVKEEKKT 
seq id no 154; EPKVKEEKKKT 

seq id no 155; EPKVKEEKKRLRRINGRGSQIRSRNAFNRSEE 
seq id no 156; EPKVKEEKRLRRINGRGSQIRSRNAFNRSEE 
seq id no 157; TFRYKGKQHPFFST 
seq id no 158; GPNAPEEKNH 
seq id no 159; GPNAPEEKKNH 
seq id no 160; GPNAPEEKKTFRYKGKQHPFFST 
seq id no 161; GPNAPEEKTFRYKGKQHPFFST 
seq id no 162; MQNTCV 
seq id no 163; KMQNTCV 
seq id no 164; KCKIRVFSK 
25 seq id no 165; CKIRVFSK 

seq id no 166; FFKRTVQKMQNTCV 
seq id no 167; FFKRTVQKKMQNTCV 
seq id no 3 FFKRTVQKKCKIRVFSK 
seq id no 169; FFKRTVQKCKIRVFSK 
30 seq id no 170; LPHYLAH 
seq id no 171; CLITWLTN 
seq id no 172; GSTTGLSATPLPHYLAH 
seq id no 173; GSTTGLSATPPLPHYLAH 
seq id no 174; GSTTGLSATPPCLITWLTN 
35 seq id no 175; GSTTGLSATPCLITWLTN 
seq id no 17 6; RFADKPRPN 
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seq 


id 


no 


177; 


DLPTSPDQTRSGPVHVSVEP 




seq 


id 


no 


178; 


DSAAGCSGTPRFADKPRPN 




seq 


id 


no 


17 9; 


DSAAGCSGTPPRFADKPRPN 




seq 


id 


no 


180; 


DS AAGC SGT P PDL PT S PDQTRSG P VHV S VE P 


5 


seq 


id 


no 


181; 


DSAAGCSGTPDLPTSPDQTRSGPVHVSVEP 




seq 


id 


no 


182; 


AHPETPAQNRLRIPCSRREVRSRACKPPGAQGSDER- 












-RGKASPGRDCDVRTGRP 




seq 


id 


no 


183; 


PAHPETPAQNRLRIPCSRREVRSRACKPPGAQGSDER- 












- RGKAS PGRDCDVRTGRP 


10 


seq 


id 


no 


184; 


RPTRRHPRRIASGSPAVGGR 




seq. 


id 


no 


185; 


VAIRGHPRPPAHPETPAQNRLRIPCSRREVRSRACKP- 












-PGAQGSDERRGKAS PGRDCDVRTGRP 




seq 


id 


no 


186; 


VAIRGHPRPPPAHPETPAQNRLRIPCSRREVRSRACKP 












-PGAQGSDERRGKAS PGRDCDVRTGRP 


15 


seq 


id 


no 


187; 


VAIRGHPRPPRPTRRHPRRIASGSPAVGGR 




seq 


id 


no 


188; 


VAIRGHPRPRPTRRHPRRIASGSPAVGGR 




seq 


id 


no 


189; 


RGRTSGRSLSCCRRPRCRPAVASRSTAPSPRAGSR- 












-RCCLRTSCGAARPRRTRS ACGDWVAS PPTRS S - 












- SRTACGAASPPARSWSAP 


20 


seq 


id 


no 


190; 


GGGHLEEV 




seq 


id 


no 


191; 


YFGGPDSTPRGRTSGRSLSCCRRPRCRPAVASR- 












- S TAP S PRAG S RRC C LRT S C G AAR PRRTRS AC GD - 












- WVAS PPTRS S SRTACGAAS PPARSWS AP 




seq 


id 


no 


192; 


YFGGPDSTPPRGRTSGRSLSCCRRPRCRPAVASR- 


25 










-STAPSPRAGSRRCCLRTSCGAARPRRTRSACGDW- 












-VASPPTRSSSRTACGAASPPARSWSAP 




seq 


id 


no 


193 ; 


YFGGPDSTPPGGGHLEEV 




seq 


id 


no 


194; 


YFGGPDSTPGGGHLEEV 




seq 


id 


no 


195; 


HRVADP 


30 


seq 


id 


no 


196; 


LSQSSELDPPSSR 




seq 


id 


no 


197; 


LSQSSELDPPPSSR 




•3 


id 




19ft- 






seq 


id 


no 


199; 


LSQSSELDPHRVADP 




seq 


id 


no 


200; 


VILLPEDTPPS 


35 


seq 


id 


no 


201; 


VILLPEDTPPPS 




seq 


id 


no 


202; 


VILLPEDTPPLLRA 
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30 



seq 


id 


no 


203; 


VI LL peldpllra 


seq 


id 


no 


204; 


PSPLP 


seq 


id 


no 


205; 


PLLFHRPC S PS PALGATVLAVYRYE 


seq 


id 


no 


206; 


LLFHRPC S P S PALGATVLAVYRYE 


seq 


id 


no 


207; 


APRPPLGPPSPLP 


seq 


id 


no 


208; 


APRPPLGPPPSPLP 


seq 


id 


no 


209; 


APRPPLGPPPLLFHRPCSPSPALGATVLAVYRYE 


seq 


id 


no 


210; 


APRPPLGPPLLFHRPCSPSPALGATVLAVYRYE 


seq 


id 


no 


211; 


TQ VL PQGC S L S LLHTTF PHRQVPH I LDW 


seq 


id 


no 


212; 


PTQ VL PQGC S L S LLHTTF PHRQ VPH I LDW 


seq 


id 


no 


213; 


PLQSFPKDAASAFSTPRFPTDKFPTSWTGSCPGQPHGT- 



-RAFCQPGPEFNAFSAC 
seq id no 215; PS PRPQS QP PTQVL PQGC SL SLLHTTF PHRQVPH I LDW 
seq id no 216; PSPRPQSQPPPTQVLPQGCSLSLLHTTFPHRQVPHILDW 
seq id no 217; PSPRPQSQPPPLQSFPKDAASAFSTPRFPTDKFPTS- 

-WTGSCPGQPHGTRAFCQPGPEFNAFSAC 
seq id no 218; PSPRPQSQPPLQSFPKDAASAFSTPRFPTDKFPTS- 

-WTGSCPGQPHGTRAFCQPGPEFNAFSAC 
seq id no 219; T AWPGRRRF TT PE P YC LCT PLG PWAPRFLW 
seq id no 22 0; PTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 221; PRPGPAGGALLPRSLTAFVPHSGHGLPVSSGEPAYTPIP- 

-HDVPHGTPPFC 

seq id no 222; RPGPAGGALLPRSLTAFVPHSGHGLPVS SGEPAYTPI PH- 

-DVPHGTPPFC 

seq id no 223; DLPAVPGPPTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 224; :jLPAVPGPPPTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
seq id no 22 5; DLPAVPGPPPRPGPAGGALLPRSLTAFVPHSGHGLPVSSG- 

-EPAYTPIPHDVPHGTPPFC 
seq id no 22 6; DLPAVPGPPRPGPAGGALLPRSLTAFVPHSGHGLPVSSG- 

-EPAYTPIPHDVPHGTPPFC 
seq id no 227; QWGLSWMS 
seq id no 228; NGDCHGC PEGRQSL 
seq id no 229; FTMDRVLTPQWGLSWMS 



-RAFCQPGPEFNAFSAC 



seq id no 



214; 



LQSF PKDAAS AF ST PRF PTDKF PTSWTG S C PGQ PHGT - 
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seq id no 23 0; FTMDRVLTPPQWGLSWMS 
seq id no 231; FTMDRVLTPPNGDCHGCPEGRQSL 
seq id no 232; FTMDRVLT PNGDCHGC PEGRQ S L 

seq id no 233; HHPARQCPHCIMHLQTQLIHRNLTGPSQLTSLHRS- 
5 - PYQIAATPWTTDFAASFFLNPVTPFLLCRRCQGKDV- 

- LC TN ARC L S QT S P S HHKAL S RTTTQCMNT - 
-TPWLAVRPAKAFPLL 

seq id no 234; PHHPARQCPHCIMHLQTQLIHRNLTGPSQLTSLHRS- 

-PYQIAATPWTTDFAASFFLNPVTPFLLCRRCQGK- 
10 -DVLCTNARCLSQTSPSHHKALSRTTTQCMNTTP- 

-WLAVRPAKAFPLL 

seq id no 235; HTI QHAS VPTASC I SKLNS YTEN 

seq id no 23 6; PQVGMRP SN P PHH P ARQC PHC IMHLQTQL I HRNLT - 

-GPSQLTSLHRSPYQIAATPWTTDFAASFFLNPVTPFL- 
15 - LC RRC QGKD VLCTN ARC L S QT S P S HHKAL S RTTTQC - 

-MNTTPWLAVRPAKAFPLL 
seq id no 23 7; PQVGMRPSNP PPHHPARQC PHC IMHLQTQL IHRNLTGP S - 

-QLTSLHRSPYQIAATPWTTDFAASFFLNPVTPFLLCRRC- 
-QGKDVLCTNARCLSQTSPSHHKALSRTTTQCMNTTPWLA- 
20 -VRPAKAFPLL 

seq id no 23 8; PQVGMRPSNPPHTIQHASVPTASCISKLNSYTEN 
seq id no 23 9; PQVGMRPSNPHTIQHASVPTASCISKLNSYTEN 
seq id no 240; WAARSWCERRAAAVAPLAPWAWGCPAGCTPPVAARAC- 

-AATRPEGWRSPCTH 
25 seq id no 241; PWAARSWCERRAAAVAPLAPWAWGC PAGCT P PVAA- 

- RAC AATRPEGWRS PCTH 
seq id no -2 42; RGLRGAGARGGLRLLRHLRPGLGDALRGVHPPLR- 

-LGPALLPAPRGGEAPAHTdDARARRVTIGAGGDRGHPGPAAL . 
seq id no 2 43; EEKLARCRPPWAARSWCERRAAAVAPLAPWAWGCPAGC- 
30 -TPPVAARAC AATRPEGWRS PCTH 

seq id no 244; EEKLARC R P P PWAARSWCERRAAAVAPLAPWAWGC PA - 

-GCTPPVAARACAATRPEGWRSPCTH 
seq id no 245; EEKLARCRPPRGLRGAGARGGLRLLRHLRPGLGDA- 

- LRG VH P P LRLG PALL P APRGGE AP AHT D ARARRVHG AGG - 
35 -DRGHPGPAAL 
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seq id no 246; EEKLARCRPRGLRGAGARGGLRLLRHLRPGLGDALRG- 

-VHPPLRLGPALLPAPRGGEAPAHTDARARRVHGAGG- 
-DRGHPGPAAL 

seq id no 247; QPPVSPRPRRPGRPRAPPPPQPMVSPRRRTTGPPW- 

-RPPPLQSTMSPPPQALHQAQLLLWCTTAPLPGLPQPQ- 
-PARALHSQFPATTLILLPPLPAIAPRLMPVALTIARYL- 
-LSPPPITALLPSCLLGSLSFSCLFTFQTSSLIPLW- 
- K I PAPTTTKSCRETFLKW 

seq id no 248; S PGCHLGPGDQAAPGLHRPPS PWCHLGAGQQARLGVHR- 

- P S S PQCHLGLRLC I RL S F YSGAQRHLC QGYHNPS QQEHS - 
-ILNSQPPL 

seq id no 249; KPAPGSTAPQPPVSPRPRRPGRPRAPPPPQPMVSPRR- 

-RTTGPPWRPPPLQSTMSPPPQALHQAQLLLWCTTAP- 
-LPGLPQPQPARALHSQFPATTLILLPPLPAIAPRLMPVA- 
-LTIARYLLSPPPITALLPSCLLGSLSFSCLFTFQTS- 
-SLIPLWKIPAPTTTKSCRETFLKW 

seq id no 250; KPAPGSTAPPQPPVSPRPRRPGRPRAPPPPQPMVSPR- 

-RRTTGPPWRPPPLQSTMSPPPQALHQAQLLLWCT- 
-TAPLPGLPQPQPARALHSQFPATTLILLPPLPAIAP- 
-RLMPVALTIARYLLSPPPITALLPSCLLGSLSFSCLF- 
- TFQTSSLI PLWKI PAPTTTKSCRETFLKW 

seq id no 2 51; KPAPGSTAPPSPGCHLGPGDQAAPGLHRPPSPWCHL- 

- GAGQQARLGVHRP S S PQCHLGLRLC I RL S FYSGA- 
- QRHLCQGYHNPSQQEHS I LNSQPPL 

seq id no 2 52; K PAPGSTAP S PGCHLGPGDQAAPGLHRP P S PWCHL - 

-GAGQQARLGVHRPSSPQCHLGLRLCIRLSFYSGAQ- 
- RHLCQGYHNPSQQEHS I LNSQPPL 

seq id no 2 53; QPr :TV~ PRRRTTGPPWRPPPLQSTMSPP PQALHQAQL - 

-LLWCTTAPLPGLPQPQPARALHSQFPATTLILLPPLP- 
-AIAPRLMPVALTIARYLLSPPPITALLPSCLLGSL- 
- SFSCLFTFQTSSLI PLWKI PAPTTTKSCRETFLKW 

seq id no 2 54; S PWCHLGAGQQARLGVHRPS S PQCHLGLRLC IRLSF- 

-YSGAQRHLCQGYHNPSQQEHSILNSQPPL 

seq id no 2 55; RPPPGSTAPQPMVSPRRR 

seq id no 2 56; RPPPGSTAPPQPMVSPRRR 

seq id no 2 57; RPPPGSTAPPSPWCHLGA 
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seq 


id 


no 


258; 


RPPPGSTAPSPWCHLGA 




seq 


id 


no 


259; 


RPRAPPPPSPWCHL 




seq 


id 


no 


260; 


RPRAPPPPPSPWC 




seq 


id 


no 


261; 


RPRAPPPPAHGVTSAP 


5 


seq 


id 


no 


262; 


RPRAPPPPPAHGV 




seq 


id 


no 


263; 


APGLHRPPQPMVSP 




seq 


id 


no 


264; 


AAPGLHRPQPMVS PR 




seq 


id 


no 


265; 


PGLHRPPPAHGVT 




seq 


id 


no 


266; 


APGLHRP P AHG VT S 


10 


seq 


id 


no 


267; 


VDRPQHTEWL SWSNL YRI RHQ 




seq 


id 


no 


268; 


HYLCTDVAPR 




seq 


id 


no 


269; 


HYLCTDVAPPR 




seq 


id 


no 


270; 


HYLCTDVAPPVDRPQHTEWLSWSNLYRIRHQ 




seq 


id 


no 


271; 


HYLCTDVAPVDRPQHTEWLSWSNLYRIRHQ 


15 


seq 


id 


no 


272; 


SAYLSPLGTTWLRTCACRLPRPAASCLCTTPSLLW- 












- PRRTCPAGS PRATS SPWRMPAPKSCCTTGLAFTS - 












-PIGLGWRSATASGYARIWPVLSLTCQSWSTSLPSTAVTW 




seq 


id 


no 


273; 


PSAYLSPLGTTWLRTCACRLPRPAASCLCTTPSLLWP- 












- RRTC PAG S PRAT S S PWRMPAPKSCCTTGLAFTS P - 


20 










-IGLGWRSATASGYARIWPVLSLTCQSWSTSLPSTAVTW 




seq 


id 


no 


274; 


PAPIFLLWGPLG 




seq 


id 


no 


275; 


APIFLLWGPLG 




seq 


id 


no 


276; 


LPARAPGPPSAYLSPLGTTWLRTCACRLPRPAASCL- 












-CTTPSLLWPRRTCPAGSPRATSSPWRMPAPKSCC- 


25 










-TTGLAFTSPIGLGWRSATASGYARIWPVLSLT- 












-CQSWSTSLPSTAVTW 




seq 


id 


no 


277; 


LPARAPGPPPSAYLSPLGTTWLRTCACRLPRPAAS- 












-CLCTTPSLLWPRRT©PAGSPRATSSPWRMPAPKSCC- 












-TTGLAFTSPIGLGWRSATASGYARIWPVLSLTC- 


30 










-QSWSTSLPSTAVTW 




seq 


id 


no 


278; 


L P ARAPG P P PA P I F LLWG PLG 

j 




seq 


id 


no 


27 9 ; 


LPARAPGPPAPIFL.LWGPL.G 




seq 


id 


no 


280; 


DLEHHGGVTRHRHR 




seq 


id 


no 


281; 


LVSDYSMTPRP 


35 


seq 


id 


no 


282; 


LVSDYSMTPPRP 




seq 


id 


no 


283 ; 


LVSDYSMTPPDLEHHGGVTRHRHR 
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seq 


id 


no 


284; 


LVSDYSMTPDLEHHGGVTRHRHR 




seq 


id 


no 


285; 


FHHIATDVGPFVRIGFLKIKGKIKGKSLRKPNW- 












-KTQHKLKRALMFLIVKKL 


5 


seq 


id 


no 


286; 


PFHHIATDVGPFVRIGFLKIKGKIKGKSLRKPNWK- 












- TQHKLKRALMFL I VKKL 




seq 


id 


no 


287; 


PSITLQQMLAPS 




seq 


id 


no 


298; 


SITLQQMLAPS 




seq 


id 


no 


289; 


TSCNEMNPPFHHIATDVGPFVRIGFIiKIKGK-IKGKSL 


10 










- RKPNWKTQHKLKRALMFL I VKKL 




seq 


id 


no 


290; 


TSCNEMNPPPFHHIATDVGPFVRIGFLKIKGKIKG- 












-KSLRKPNWKTQHKLKRALMFLIVKKL 




seq 


id 


no 


291; 


TSCNEMNPPS ITLQQMLAPS 




seq 


id 


no 


292; 


TSCNEMNPPPS ITLQQMLAPS 


15 


seq 


id 


no 


293; 


LEMILFLMTF 




seq 


id 


no 


294; 


HPCITKTFLEMILFLMTF 




seq 


id 


no 


295; 


HPCITKTFFLEMILFLMTF 




seq 


id 


no 


296; 


HPCITKTFFWR 




seq 


id 


no 


297; 


HPCITKTFWR 


20 


seq 


id 


no 


298; 


LMFEHSQMRLNSKNAHLPI I SF 




seq 


id 


no 


299; 


EYGSI I AFLMFEHSQMRLNSKNAHLPI I SF 




seq 


id 


no 


300; 


EYGS 1 1 AFFLMFEHSQMRLNSKNAHLPI I SF 




seq 


id 


no 


301; 


HLNKGRRLGDK I RAT 




seq 


id 


no 


302; 


FHLNKGRRLGDKIRAT 


25 


seq 


id 


no 


303; 


VTSGTPFFHLNKGRRLGDKIRAT 




seq 


id 


no 


304; 


VTSGTPFFFHLNKGRRLGDKIRAT 




seq 


id 


no 


305; 


VTSGTPFFF I 




seq 


id 


no 


306; 


VTSGTPFFI 




seq 


id 


no 


307; 


CEIERIHFFF 


30 


seq 


id 


no 


308; 


CEIERIHFFSK 




seq 


id 


no 


309; 


CEIERIHFSK 




seq 


id 


no 


310; 


FRYISKSI 




seq 


id 


no 


311; 


RYISKSI 




seq 


id 


no 


312; 


FKKYEPIFFRYISKSI 


35 


seq 


id 


no 


313 ; 


FKK YE P I FRY I S KS I 
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15 



20 



25 



30 
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35 



seq 


id 


no 


314; 


FPDSDQPGPLYPLDPSCLISSASNPQELSDCHYIH- 










- LiAFGFSNWRSCPVL PGHCGVQ 


seq 


id 


no 


315; 


PDSDQPGPLYPLDPSCLISSASNPQELSDCHYIHL- 










- AFGF SNWR S C P VL PGHCGVQ 


seq 


id 


no 


316; 


LNMFASVFS 


seq 


id 


no 


317; 


LNMFASVFFS 


seq 


id 


no 


318; 


LNMFASVFFPDSDQPGPLYPLDPSCLISSASNPQE- 










-LSDCHYIHLAFGFSNWRSCPVL PGHCGVQ 


seq 


id 


no 


319; 


LNMFASVFPDSDQPGPLYPLDPSCLI S SASNPQELS - 










- DCH Y I HL AFGF SNWR S C P VL PGHC G VQ 


seq 


id 


no 


320; 


AMEET WVAVATVETEVEAMEETGWAAMEETEVG AT - 










-EETEVAMEAKWEEETTTEMI SATDHT 


seq 


id 


no 


321; 


LWRPWLWEWLRWRPKWRLWRRQEWWRLWRRPRWGL - 










- RRRPRWLWRENGRKKRLQK 


seq 


id 


no 


322; 


YGGDRSRGAMEETVWAVATVETEVEAMEETGWAAM - 










- EETEVGATEETEVAMEAKWEEETTTEMI SATDHT 


seq 


id 


no 


323; 


YGGDRSRGGAMEETVWAVATVETEVEAMEETGWA- 










- AMEETEVGATEETEVAMEAKWEEETTTEMI SATDHT 


seq 


id 


no 


324; 


YGGDRSRGGLWVRPWLWEWLRWEPKWRLWRRQEWW- 










- RLWRRPRWGLRRRPRWLWRENGRKKRLQK 


seq 


id 


no 


325; 


YGGDRSRGLWRPWLWEWLRWEPKWRLWRRQEWWR- 










- LWRRPRWGLRRRPRWLWRENGRKKRLQK 


seq 


id 


no 


326; 


EFGGGRRQK 


seq 


id 


no 


327; 


EFGGRRQK 


seq 


id 


no 


328; 


RRAKGGGAGASNPRQ 


seq 


id 


no 


329; 


GRRAKGGGAGASNPRQ 


seq 


id 


no 


330; 


DVGLtREG AL EL PTRGNKRNVA 


seq 


id 


no 


331; 


MRGGGGVGGRRAKGGGAGASNPRQ 


seq 


id 


no 


332; 


MRGGGGVGGGRRAKGGGAGASNPRQ 


seq 


id 


no 


333; 


MRGGGGVGGDVGLREGALEL PTRGNKRNVA 


seq 


id 


no 


334; 


MRGGGGVGDVGLREGALELPTRGNKRNVA 


seq 


id 


no 


335; 


VWQLAG PML AGWRS LG SWFCRMYG I 


seq 


id 


no 


336; 


CGSWPALCWRAGGVWAVGSAGCMEYDPEALPAAWGP- 










-AAAATVHPRR 


seq 


id 


no 


337; 


RRY PC EWG VWQLAG PML AGWR S LG S WFC RMYG I 
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5 



25 



seq 


id 


no 


338 ; 


seq 


id 


no 


339; 


seq 


id 


no 


340; 


seq 


id 


no 


341; 


seq 


id 


no 


342; 


seq 


id 


no 


343; 


seq 


id 


no 


344; 


seq 


id 


no 


345; 


seq 


id 


no 


346; 


seq 


id 


no 


347; 


seq 


id 


no 


348; 


seq 


id 


no 


349; 


seq 


id 


no 


350; 


seq 


id 


no 


351; 


seq 


id 


no 


352; 


seq 


id 


no 


353; 


seq 


id 


no 


354; 


seq 


id 


no 


355; 


seq 


id 


no 


356; 


seq 


id 


no 


357; 


seq 


id 


no 


358; 



BNSDOCtD: <WO 9958552A2 I. > 




PCT/NO99/00143 



RR Y PC EWGG VWQ LAG PML AGWR S LG SWF C RM YG I 
RRYPCEWGGCGSWPALCWRAGGVWAVGSAGCMEYD- 
- E AL P AAWG P AAAAT VH PRR 

RRYPCEWGCGSWPALCWRAGGVWAVGSAGCMEYDPE- 

- AL P AAWG P AAAATVH PRR 

LWLWAGWTVWWS CGPGEKGHGW P S L PTMALLL LRF S CM - 
-RVASY 

GLWLWAGWTVWWSCGPGEKGHGWPSLPTMALLLL- 
-RFSCMRVASY 

GCGCGPAGQYGGAVGLARRGTAGCLPCPPWLCCCCAF- 

- P ACGL PGTDGWRGWQG S GC VRVS G S APWA PGF PF S P - 
-PCPLCGTQPRW 

C GCG P AGQ YGG AVGL ARRGTAG C L PC P PWLC C C C AF P ACG - 

- LPGTDGWRGWQGSGCVRVSGSAPWAPGFPFS PPC - 
-PLCGTQPRW 

LAFNVPGGLWLWAGWTVWWSCGPGEKGHGWPSLPTMA- 

- LLLLRF SCMRVAS Y 

L AFNVPGGGLWLWAGWTVWWS C G PGEKGHGWP S L PTM - 
- ALLL LRF S CMRVAS Y 

LAFWPGGGCGCGPAGQYGGAVGLARRGTAGCLPCPP- 
-WLCCCCAFPACGLPGTDGWRGWQGSGCVRVSGSAPW- 
-APGFPFSPPCPLCGTQPRW 

LAFNVPGGCGCGPAGQYGGAVGLARRGTAGCLPCPPW- 

-LCCCCAFPACGLPGTDGWRGWQGSGCVRVSGSAPWA- 

- PGF PF S P PC PLCGTQ PRW 

PPMPMPGQREAPGRQEA 

G P PMPMPGQRE APGRQEA 

GHQCQCQGKuxvHRADRRPDTAQEE 

HQC QC QGKGRHRADRRPDT AQEE 

GGHSYGGGPPMPMPGQREAPGRQEA 

GGHSYGGGGPPMPMPGQREAPGRQEA 

GGHSYGGGGHQCQCQGKGRHRADRRPDTAQEE 

GGHSYGGGHQCQCQGKGRHRADRRPDTAQEE 

APCPQSSGGG 

LPAPSQAAADELDRRPG 
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37 





seq 


id 


no 


359; 




seq 


id 


no 


360; 




seq 


id 


no 


361; 




seq 


id 


no 


362; 


5 


sea 


id 


no 


363 ; 




seq 


id 


no 


364; 




seq 


id 


no 


365; 


10 


seq 


id 


no 


366; 




seq 


id 


no 


367; 




seq 


id 


no 


368; 


15 


seq 


id 


no 


369 ; 




seq 


id 


no 


370; 




seq 


id 


no 


371 • 




seq 


id 


no 


372 ; 




seq 


id 


no 


373 ; 


20 


seq 


id 


no 


374; 




seq 


id 


no 


37 5; 




seq 


id 


no 


37 6; 




seq 


id 


no 


377; 




seq 


id 


no 


37 8; 


25 


seq 


id 


no 


37 9; 




seq 


id 


no 


380; 




seq 


id 


no 


3 81; 




seq 




no 


TOO. 

Jo/; 


30 


seq 


id 


no 


3 84; 




seq 


id 


no 


38.5; 



-DLYRELLGG 



-DLYRELLGG 



-AAASHLNKDLYRELLGG 
ENSRPKRGGGCSLAKDGSTEDr 
- ELEAAASHLNKDL YRELLGG 



RRTPSTEKR 
RRTPSTEKKR 



-DMTAILSQS 

SSEEIKKKSTTKCQSGTAE1 
-TSQMNRQVKDMTAILSQS 



- VT S QMNRQ VKDMT A I L S Q S 
seq id no 3 86; S S EE I KKKK VQ PNAS Q AQ QK PTTHGR 
35 seq id no 3 87; S S EE I KKKVQ PNA S Q AQQK PTTHGR 
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seq id no 3 88; NRGWVGAGE 

seq id no 3 89; IEAG 

seq id no 3 90; VHNYCNMKNRGWVGAGE 

seq id no 3 91; VHNYCNMKKNRGWVGAGE 

5 seq id no 392; VHNYCNMKK I E AG 

seq id no 393; VHNYCNMKIEAG 

seq id no 394; QLRCWNTWAKMFFMVFLI IWQNTMF 

seq id no 3 95; VKKDNHKKQLRCWNTWAKMFFMVFLI IWQNTMF 

seq id no 396; VKKDNHKKKQLRCWNTWAKMFFMVFLI IWQNTMF 

10 seq id no 397; VKKDNHKKKNS 

seq id no 3 98; VKKDNHKKNS 

seq id no 3 99; GAEESGPFNRQVQLKVHASGMGRHLWNCPAFWSEV 

seq id no 400; HPSPPPEKRS 

seq id no 401; HPSPPPEKKRS 

15 seq id no 402; HPSPPPEKKGAEESGPFNRQVQLKVHASGMGRHLW- 



seq id no 403; HPSPPPEKGAEESGPFNRQVQLKVHASGMGRHLWN- 



seq id no 4 04; MQVLSKTHMNLFPQVLLQMFLRGLKRLLQDLEKSKKRKL 
20 seq id no 405; RCKSARLI 

seq id no 406; VQTQPAIKKMQVLSKTHMNLFPQVLLQMFLRGLKRLLQ- 



seq id no 407; VQTQPAIKKKMQVLSKTHMNLFPQVLLQMFLRGLKRL- 



25 seq id no 408; VQTQPAIKKRCKSARLI 

seq id no 409; VQTQPAIKRCKSARLI 

seq id no 410; ARSGKKQKRKL 

seq id no 411; ARSGKKQKKRK^ 

seq id no 412; ARS GKKQKKENF S 

30 seq id no 413; ARSGKKQKENFS 

seq id no 414; KASARSGKSKKRKL 

seq id no 415; KASARSGKKSKKRKL 

seq id no 416; KASARSGKKAKKENSF 

seq id no 417; KASARSGKAKKENSF 

35 seq id no 418; HLNKGRRLGDKIRAT 

seq id no 419; VTSGTPFFHLNKGRRLGDKIRAT 



-NCPAFWSEV 



-CPAFWSEV 



-DLEKSKKRKL 



-LQDLEKSKKRKL 
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seq 


id 


no 


420; 


VTSGTPFFFHLNKGRRLGDKIRAT 




seq 


id 


no 


421; 


VTSGTPFFFI 




seq 


id 


no 


422; 


VTSGTPFFI 


5 


seq 


id 


no 


423; 


VTLL YVNTVTLt A PNVNME S S RNAH S PATP S AKRK - 
- DPDLTWGGFVFFFCQFH 




seq 


id 


no 


424; 


KCRCKPNFFVTLLYVNTVTLAPNVNMES SRNAHS P - 
-ATPSAKRKDPDLTWGGFVFFFCQFH 




seq 


id 


no 


425; 


KCRCKPNFFFVTLLYVNTVTLAP3SIVNMESSRNAH- 
-SPATPSAKRKDPDLTWGGFVFFFCQFH 


10 


seq 


id 


no 


426; 


KCRCKPNFFL 




seq 


id 


no 


427; 


KCRCKPNFL 




seq 


id 


no 


429; 


LVKKLKEKKMNWI L 




seq 


id 


no 


430; 


LVKKLKEKKKMNWI L 




seq 


id 


no 


431; 


LVKKLKEKKR 


15 


seq 


id 


no 


432; 


LVKKLKEKR 




seq 


id 


no 


433; 


AAIVKDCCR 




seq 


id 


no 


434; 


S Q PAS I LGRKL 




seq 


id 


no 


435; 


S Q PAS I LGKRKL 




seq 


id 


no 


436; 


SQPASILGKAAIVKDCCR 


20 


seq 


id 


no 


437; 


SQPASILGAAIVKDCCR 




seq 


id 


no 


459; 


NTWAKMFFMVFLI IWQNTMF 



Examples of cancers particularly suitable for treatment 
25 with one or a combination of several of this compounds 
are: colorectal cancer, breast cancer, small-cell lung 
cancer, non small-cell lung cancer, liver cancer (primary 
and secondary) , renal cancer, melanoma, ovarian cancer, 
cancer of the brain, head and neck cancer, pancreatic 
30 cancer, gastric cancer, eosophageal cancer, prostate 
cancer and leukemias and lymphomas . 

Below are listed some examples of where these mutations 
may result in gene products that result in development of 
35 tumours : 
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Development of colorectal cancers are believed to result 
from a series of genetic alterations. Deleted in 
colorectal cancer (DCC) gene (seq id nos 30-34), Human 
cysteine protease (ICErel-III) gene (seq id nos 394-398 
5 and 459), Human putative mismatch repair /binding protein 
(hMSH3) gene (Seq id hos 134-147), Human hMSH6 gene (seq 
id nos 201-204 and 295-299), Human n-myc gene (seq id nos 
190-195), Human TGF02 (hTGFp2) gene (seq id nos 22-29), 
Human p53 associated gene (seq id nos 287-294) may be 
10 involved in colorectal cancer. 

Human breast cancer susceptibility (BRCA2) (seq id nos 
3 5-94) and Human BRCAl-associated RING domain protein 
(BARD1) gene (seq id nos 404-413) are involved in breast 
15 cancer and ovarian cancer Human hMSH6 gene (seq id nos 
2 01-2 04 and 295-2 99) may be involved in brain tumours. 

Gene alteration are frequent in many types of 
adenocarcinomas , below are listed some genes that are 
20 mutated in many cancers : 

Human breast cancer susceptibility (BRCA2) gene (seq id 
nos 35-94) , Deleted in colorectal cancer (DCC) gene (seq id 
nos 30-34), Human putatative mismatch repair/binding 

25 protein (hMSH3) gene (seq id nos 134-147), Human hMSH6 
gene (seq id nos 201-204 and 295-299), human N-MYC gene 
(seq id no 190-195), Human TGFb2 (hTGFb2) gene (seq id nos 
22-29), Human p53 associated c^n » (seq id nos 287-294), 
Human MUC1 gene (seq id nos 248-267), Human germline n-myc 

30 gene (seq id nos 184-195), Human Wilm's tumour (WIT-1) 

associated protein (seq id nos 388-393), Human nasopharynx 
carcinoma EBV BNLF-1 gene (seq id nos 205-211), Human 
transforming growth factor-beta inducted gene product 
(BIGH3) seq id nos 228-233). 

35 
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Many of the mutated genes may result in development of 
leukemias and lymphomas: Human neurof ibromin (NF1) gene 
(seq id nos 178-183), b-raf oncogene (seq id nos 172-177), 
Human protein- tyrosine kinase (JAK1) gene (seq id nos 
5 268-272), Human protein- tyrosine kinase (JAK3) gene (seq 
id nos 273-280) are examples. 

Genes involved in malignant melanoma: Human malignant 
melanoma metastasis-supressor (hKiSS-1) gene (seq id nos 
10 331-337), Genes involved in metastasis: Human 

metastasis-assosiated mtal (hMTAl) gene (seq id nos 
360-365) . 

Cell cycle control and signal transduction is strikcly 

15 regulated. Frameshift mutations in these genes may result 
in uncontrolled cell growth. Examples of genes which may 
be suseptable are: Human protein tyrosine phosphatase 
(hPTP) gene (seq id nos 95-102), Human kinase (TTK) gene 
(seq id nos 109-121) , Human transcriptional repressor 

20 (CTCF) gene (seq id nos 122-128), Human cell cycle 

regulatory protein (ElA-binding protein) p300 gene (seq id 
nos 212-219), Human tranforming growth factor-beta 
inducted gene product (BIGH3) (seq id nos 228-233), Human 
FLt4 gene (for transmembrane tyrosinase kinase (seq id nos 

25 281-286) , Human G protein-coupled receptor (hGPRl) gene 

(seq id nos 317-322), Human transcription factor . (hITF-2 ) 
gene (seq id nos 329-330) , Human telomerase-associated 
protein TP-1 (hTP-l : ). gene (seq id nos 338-351), Human 
transcription factor TFIIB 90 kDa subunit (hTFBIIB90 ) gene 

30 (seq id nos 366-373), Human FADD-homologous ICE/CED-3 like 
protease gene (seq id nos 129-133) 

Mutations in DNA synthesis or -repair enzymes may also 
lead to uncontrolled cell growth. Human DNA topoisomerase 
35 II (top2) gene (seq id nos 103-108) and Human putative 

mismatch repair /binding protein (hMSH3) gene (seq id nos 
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134-147) and (hMSH6) gene (seq id nos 201-204 and 
205-299) . 

The following are tumour suppressor genes, Human 
5 retinoblastoma binding protein 1 isoform I (hRBPl) gene 
(seq id hos 148-158), Human neurof ibromin (NF1 ) gene (seq 
id nos 178-183), Human p53 associated gene (seq id nos 
287-294), Human retinoblastoma related protein (pl07) gene 
(seq id nos 312-316), Human tumour suppressor (hLUCA-1) 
10 gene (seq id nos 374-381), Mutations in these genes may 
result in development of cancer. 

The following are oncogenes, proto-oncogenes or putative 
oncogenes ; Human germline n-myc gene (seq id nos 

15 184-189), Human n-myc gene (seq id nos 190-195), Human can 
(hCAN) gene (seq id nos 300-302), Human dek (hDEK) gene 
(seq id nos 309-311), b-raf oncogene (seq id nos 172-177), 
Human DBL (hDBL) proto-oncogene / Human MCF2PO (hMCF2PO) 
gene (seq id nos 303-308) . Frameshift mutations in these 

20 genes may lead to development of cancer. 



BIOLOGICAL EXPERIMENTS 

25 Description of the Figures 
FIG. 1: 

It has been demonstrated that T cells from normal donors 
can be stimulated with a mixture of peptides containing 

30 both mutant BAX and mutant TGFpRII peptides. Peptide 

mixture dependent T cell proliferation in blood samples 
from six different donors are shown in figure 1. The 
results were obtained by stimulating peripheral blood 
mononuclear cells (PBMCs)from each donor with a mixture of 

35 mutant BAX peptides (seq id nos 1,9-12) and mutant TGFfiRII 
peptides (seq id nos 15-21) . The concentration of each 
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individual peptide in the mixture was 2 0 fiM. After two 
weeks, and weekly thereafter, the bulk cultures were 
restimulated with autologous PBMCs pulsed with 10—25 |XM of 
the peptide mixture. After 4-5 restimulations the bulk 
5 cultures were tested in a standard proliferation assay with 
PBMCs alone or as a control or PBMCs pulsed with 2 5 |UM of 
the peptides as antigen presenting cells (APCs) . 

FIG . 2: 

10 It has further been found that T cell clones can be 

generated against separate peptides of the mixture used in 
the bulk stimulation experiments. Figure 2 shows the 
proliferation of T cell clone 521-2 which was obtained by 
cloning the bulk culture from donor 1 (figure 1) by seeding 

15 5 cells per well in U-bottomed, 96-well microtiter plates 

and using autologous PBMCs pulsed with 25 |XM of the mutant 
BAX peptide with seq id no 12 as feeder cells. Autologous 
B-lymphoblastoid cells were used as APCs in the 
proliferation assay. 

20 

FIG , 3 : 

In figure three it is shown that mutant BAX peptides and 
mutant TGFfiRII peptides can be used to stimulate T cells 
(PBMCs) from a patient with breast cancer. Dendritic cells 

25 (DCs) from the same cancer patient were used as APCs. The T 
cell stimulation (figure 3) was obtained by pulsing DCs 
separately with a mixture of mutant BAX peptides (seq id 
nos 1,9-12) and a mixture of mutant TGF{3RII peptides (seq 
id nos 15-21) followed by addition of autologous PBMCs and 

30 10 ng/ml tumour necrosis factor. The concentration of each 
peptide in the mixtures used for pulsing was 25 ^iM. The 
PBMCs and the DCs were obtained by leukapheresis from a 
patient with breast cancer who had been on a granulocyte 
colony stimulating factor (G-CSF) treatment. The CD34+ 
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cells were isolated from the cell product before DCs were 
derived using standard methods. 



FIG. 4: 



5 Figure 4 shows the capability of T cells obtained from 

ascites fluid of a pancreatic cancer patient to recognise 
and proliferate to different synthetic peptides derived 
from mutant BAX (seq id nos 1,9-12) and mutant TGF0RII (seq 
id nos 15,17-21). The T cell line was obtained after 

10 expansion of T cells present in the ascites fluid of a 

patient with pancreatic adenocarcinoma. The T cell line was 
expanded in vitro by culturing with 100 U/ml recombinant 
interleukin-2 (rIL-2) (Amersham, Aylesbury, UK) for one 
week before beeing tested in a proliferation assay. 



Autologous, irradiated (3 0Gy) PBMCs were seeded 5 x 104 in 
u-bottomed 96-well plates (Costar, Cambridge, MA) and 
pulsed with single synthetic peptides at 20 (iM for 2h. The 
T cells were added 5 x 104 per well and the plates were 

20 incubated for four days at 37°C with addition of 18.5 x 104 
Bq/mL 3H- thymidine for the last 12 hours before harvesting. 
The plates were counted in a liquid scintillation counter 
(Packard Topcount) . Data represent specific proliferation 
to the different synthetic peptides and values are 

25 expressed as the mean of triplicate cultures. These results 
show that T cells isolated from a pancreatic cancer patient 
are capable of responding to a panel of peptides carrying 
amino acid sequences derived from mutant BAX and TGFpRII . 

30 FIG. 5: 

Figure 5 further demonstrates the capability T cells from 
another pancreatic cancer patient to recognise and 
proliferate to different synthetic peptides derived from 
mutant BAX and mutant TGF(3RII. The T cell line was obtained 
35 after expansion of T cells present in the ascites fluid of 



15 
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a patient with pancreatic adencarcinoma. The experiment was 
set up in the same way as described above. Data represent 
specific proliferation to the different synthetic peptides 
and values are expressed as the mean of triplicate 
5 cultures. 

In order to investigate the T cell response from the latter 
pancreatic cancer patient, responding T cells were cloned. 
Peritoneal macrophages were irradiated (30 Gy) and plated 1 

10 x 104 into U-bottomed 96-well plates (Costar) together with 
25 (IM of each peptide. T cell blasts were counted in a 
microscope and added 5 blasts per well together with 100 
U/ml human recombinant interleukin-2 (rIL-2) (Amersham, 
Aylesbury, UK) in a total volume of 200 rnL . After 14 days T 

15 cell clones were transferred onto 24-well plates (Costar) 
with 1 mg/mL phytohemagglutinin (PHA, Wellcome, Dart ford, 
UK), 100 U/ml rIL-2 and allogeneic, irradiated PBMCs as 
feeder cells and screened for peptide specificity after 7 
and 1 4 days . 

20 

FIG. 6: 

T cell clone 520.5, 520.7 and 520.8 were selected for 
further characterisation and express the cell surface 
phenotype CD3+, CD8+ and TcR+ . Figure 6 shows the 

25 recognition and cytotoxicity of T cell clone 520.5, 520.7 
and 520.8 against peptide-pulsed autologous target cells 
pulsed with the seq id no 10 peptide. Autologous 
Epstein-barr virus transformed B-cells (EBV) were labelled 
with 3H- thymidine (9.25 x 104 Bq/ml) over night, washed 

30 once and plated 2500 cells per well in 96-well plates with 
or without 25 mM of synthetic peptide (seq id no 10) and 1% 
DMSO in medium. After 3 0 minutes incubation at 37°C the 
plates were washed before addition of T cells. The plates 
were further incubated at 37°C for 4 hours and then 

35 harvested before counting in a liquid scintillation counter 
(Packard Topcount) . Data represent percent specific lysis 
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of 3H- thymidine labelled peptide pulsed target cells at an 
effector/target ratio of 10/1. Values are expressed as the 
mean of triplicate cultures. These results demonstrate that 
the three different T cell clones obtained from ascites 
fluid of a pancreatic carcinoma patient, exhibit specific 
cytotoxicity of autologous EBV targets pulsed with the 
relevant peptide (seq id no 10) derived from mutant BAX. 



FIG . 7: 

10 Figure 7 shows the cytolytic properties of three different 
T cell clones obtained from the same patient. These T cell 
clones were cultured and expanded as described above, but 
they were generated against a synthetic peptide the seq id 
no 17 peptide carrying amino acid sequences derived from 

15 mutant TGFpRII . T cell clone 538.1, 538.3 and 538.4 all 
show the cell -surface phenotype CD3 + , CD8 + and TcR+. The 
experimental conditions were as described above (figure 6) . 
Data represent percent specific lysis of 3H-thymidine 
labelled peptide pulsed target cells pulsed with the seq id 

20 no 428 peptide at an effector/ target ratio of 10/1. Values 
are expressed as the mean of triplicate cultures. These 
results demonstrate that the three different T cell clones 
obtained from ascites fluid of a pancreatic carcinoma 
patient, exhibit specific cytotoxicity of autologous EBV 

25 targets pulsed with the relevant peptide (seq id no 428) 
derived from mutant TGFPRII. 

FIG. 8: 

Figure 8 shows the specificity of two CD4+ T cell clones, 
30 IMT8 and IMT9 , obtained from a tumour biopsy taken from a 
patient with an adenocarcinoma localised to the proximal 
colon. Immunohi s t ochemi s try revealed that the patient had 
an abundant infiltrate of predominantly CD4+ T cells, many 
of which carried activation markers. In areas of CD4 T cell 
35 infiltration islands of HLA DR positive tumour cells were 
observed. The T cell clones were obtained from the 
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component of tumour infiltrating lymphocytes which grew out 
of the biopsy following culture in medium containing 15 
U/ml of recombinant human IL.-2 for 16 days. The T cells 
from this culture were cloned by limiting dilution (1 
cells/well) in Terasaki plates with irradiated peptide 
pulsed APC and 100 U/ml of IL-2 . Pulsing of autologous APC 
was performed with a mixture of the TGF(3RII frameshift 
peptides with sequence identity no. 15, 17 and 18 at 1 
jxg/ml of each peptide in the presence of 3 jug/ml of 
purified human p2 microglobulin and lOng/ml of recombinant 
human TNFCX for 3 hrs at 37 °C . Of the 14 clones that could 
be expanded preliminary tests showed that two of the clones 
were reactive with the peptide mixture used for cloning. 
After expansion the clones were screened for reactivity 
with the single peptides in a standard proliferative assay. 
The results show that IMT8 and IMT9 both react specifically 
with the TGF0RII frameshift peptide with seq. id. no. 17, 
no reactivity was observed with the two other frameshift 
peptides tested. 

The figure (Fig. 8) depicts the results of conventional T 
cell proliferative assays, where cloned T cells (5x10') and 
irradiated APC (5xl0 4 ) were cocultured for 3 days in 
triplicates before harvesting. To measure the proliferative 
capacity of the cultures, 3 H- thymidine (3,7xl0 4 Bq/well) 
was added to the culture overnight before harvesting) 
Values are given as mean counts- per minute (cpm) of the 
triplicates . 

FIG. 9: 

Figure 9 demonstrates that the specific reactivity of the 
two T cell clones IMT8 and IMT9 against the peptide with 
.seq. id.no. 17 is completely blocked by treatment of the 
cells with an antibody that specifically binds to HLA-DR 
molecules, since the reactivity after blocking is the same 
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as the background reactivity of the clones with APC in the 
absence of the peptide. On the other hand antibodies to the 
HLA class II isotypes HLA-DQ and -DP failed to block the 
reactivity of the clones with peptide pulsed APC. This 
experiment unequivocally identifies HLA-DR as the molecule 
responsible to present the peptide to these two T cell 
clones. Antibody blocking experiments were performed using 
the homozygous EBV transformed cell line 9061 (IHWS9 
nomenclature) as APC. The APC were pulsed with peptide at a 
concentration of 15 |ag/ml for 1 hr at 37 °C before addition 
of blocking antibodies L2 43 (pan-DR antibody) , SPVL3 
(pan-DQ antibody) and B7.21 (pan-DP antibody) at 10 |ig/ml. 
Unpulsed APC and APC pulsed with peptide in the absence of 
blocking antibody served as negative and positive controls 
respectively. Results are expressed as in Figure 8. 

FIG. 10: 

The patient IMT was HLA typed and turned out to be HLA: 
Al,2; B7,8; DR3,14; DQ1 , 2 . To determine which of the HLA-DR 
molecules that were responsible for presentation of the 
peptide with seq. id. no. 17, a panel of HLA workshop 
derived homozygous BCL cell lines were obtained and pulsed 
with the peptide with seq. id. no. 17. Figure 10 describes 
the identification of HLA-DR14 <DRA*0102 / DRB*1401) as the 
HLA-DR molecule responsible for presentation of the peptide 
with seq. id. no. 17 to the T cell clones IMT8 and IMT9 . A 
specific proliferative response was observed when peptide 
was presented by the autologous EBV transformed cell line 
(Auto APC) and by cell lines 9054 <EK) and 9061 (31227ABO), 
both of which expressed DR14 as the only DR molecule on 
their surface. The homozygous cell line gave higher 
responses, reflecting a higher level of expression of the 
relevant class II/peptide complexes due to the effect of a 
double dose of the genes encoding this DR molecule. No 
response was obtained when the peptide was presented by 
cell lines expressing HLA-DR 3 (9018, LOQ81785) , which 
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represents the other DR molecule expressed by the patients 
APC, nor by irrelevant HLA-DR molecules. The experiment was 
performed as described in figure 9, with the exception that 
no antibody blocking was performed. Results are expressed 
5 as in Figure 8. 

FIG. 11; 

Figure 11 describes the dose response curves obtained by 
pulsing the cell line 9054 with increasing concentrations 

10 of the peptide with seq. id. no. 17. Both IMT 8 and IMT9 

demonstrate a dose dependent increase in the proliferative 
response to the peptide. Results were performed as 
described in Fig. 9 and 10 with the peptide concentrations 
indicated on the Figure (Fig. 11). Results are expressed as 

15 in Fig. 8. 

FIG. 12: 

Figure 12 describes the reactivity of a cell line generated 
by in vitro stimulation of . T cells isolated from peripheral 

20 blood from a healthy blood donor (Donor 2892) by weekly 
stimulation with irradiated autologous dendritic cells 
pulsed with the peptides with sequence identity numbers 16, 
17 and 21. A specific response above background values was 
obtained when the T cells were co-incubated with autologous 

25 dendritic cells pulsed with the peptide with seq. id. no. 

21. No activity could be detected in the culture after the 
first and second in vitro stimulation. These data 
demonstrate that the T cell repertoire of normal 
individuals contain a few precursor cells that have the 

30 capacity to recognise this frameshift peptide derived from 
a mutation in TGF^RII that does not occur in normal people. 
In two other blood donors (#2706 and #2896), the level of 
precursor cells with the relevant specificity was too low 
to be detected. The results are expressed as spots per 10 4 

35 T cells tested in a conventional IFNg ELISPOT assay. This 
assay enumerates the number of cells present in a mixture 
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of cells that are capable of specifically reacting with a 
defined antigen. Briefly 10 7 T cells (non adherent cells) 
were stimulated weekly with 2-5xl0 6 irradiated peptide 
pulsed autologous dendritic cells (DC) as APC. The DC were 
generated from the adherent cell population by culture for 
one week in recombinant human GM-CSF and IL-4 according to 
standard protocols as described in the literature. After 
peptide pulsing overnight at 15 \xg/ml of peptide, full 
maturation of the DC was obtained by culture with 
recombinant TNFec. ELISPOT was performed according to 
standard published protocols using 10 4 cultured T cells per 
well in duplicate and 10 4 peptide pulsed or unpulsed DC as 
APC. The results are expressed as mean number of spots per 
10* T cells. 

FIG. 13: 

Figure 13 shows the results of in vitro stimulation of T 
cells from a healthy blood donor (Donor 322) with peptides 
with sequence identity number 15-21. In vitro culture was 
performed as described in Figure 12. A proliferative 
response above background values was seen when the T cell 
culture primed with a mixture of the peptides with seq. 
id. no. 16 and 21 was stimulated with peptide 21 and the 
culture primed with the peptide with seq. id. no. 17 was 
stimulated with the same peptide. These results demonstrate 
that normal blood donors have small numbers of circulating 
T cells specific for these frameshift peptides, and that it 
is possible to expand these cells in culture by stimulation 
with frameshift peptides. These results also confirmed the 
results shown in Fig. 8-11, demonstrating that the peptide 
with seq. id. no. 17 is immunogenic in humans, and indicate 
that the peptide with seq. id. no. 21 may also be used as a 
cancer vaccine in humans. The results are expressed as 
described in Fig. 8. 
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FIG. 14: 

The results shown in Figure 14 demonstrate that CD8+ T 
cells specific for HLA class I epitopes can be generated 
from T cells present in the T cell repertoire of a healthy 
5 blood donor (donor 905) . No reactivity above background was 
seen with any of the peptides after the second round of in 
vitro restimulation. After the fourth restimulation, the 
frequency of T cells specific for the peptide with seq. id. 
no. 42 8 had increased from undetectable levels to 

10 approximately 2,5 % of the cells. These results demonstrate 
that CTL precursors of the CD8+ phenotype are present in 
the unprimed T cell repertoire of healthy blood donors. 
Such T cells may be expanded in vitro by specific 
stimulation with the peptide with seq. id. no. 428. This 

15 forms the basis for using this peptide as a cancer vaccine 
to elicit cytotoxic T cells specific for framshift peptides 
in cancer patient having such mutations . T cells were 
generated by weekly restimulation of T cells isolated from 
peripheral blood and stimulated with peptide pulsed 

20 autologous DC as described in Fig. 12 , with the exception 
that 11-7 and 11-2 was added during culture according to 
standard procedures for generating cytotoxic T cells (CTL) 
of the CD8 phenotype. The peptides used were peptides with 
sequence identity number 428, 439, 446 and 451. Cells were 

25 tested in ELISPOT assay as described in Fig. 12. The results 
are expressed as described in figure 12. 



30 The peptide with seq. id. no. 17 was selected and designed 
to contain binding motifs for both several HLA class I and 
HLA class II molecules. These peptides thus contains 
epitopes both for CD4+ and CD8+ T cells, and was predicted 
to elicit both CD4 and CD8 T cell responses in cancer 

35 patient provided processing of the aberrant TGFfiRII protein 
naturally occurring in cancer cells would take place and 
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result in an overlapping peptide. This has now been proven 
for CD4 T cells by the results in Fig. 8-11. These results 
have the following implication: 

1) The results in Figure 8 prove that the mutated form of 
TGF0RII Receptor which occurs in a high proportion of 
cancer patients with defects in their mismatch repair 
machinery is a tumour specific antigen. 



10 



2) The antigen specificity of the infiltrating T cells 
commonly observed in colorectal cancer are generally not 
known. The results in figure 8 demonstrate that one 
component of the T cells constituting the population of 
tumour infiltrating lymphocytes in this patients tumour is 
15 specific for a frameshift mutation, demonstrating that 
TGFPRII frameshift peptides are immunogenic in vivo, 
occasionally giving rise to spontaneous T cell activation. 



20 



25 



30 



35 



3 )It follows from this observation that processing of the 
non- functional form of the TGF0RII Receptor that is formed 
by the common frameshift mutation is processed. This 
processing may take place either in the tumour cell as part 
of natural breakdown of the aberrant protein, or after the 
tumour cell itself or a released form of the receptor has 
been taken up by a professional APC or both. 

4) The results in Figure 8 also indicate that the peptide 
with seq. id. no. 17 is capable of binding to an HLA class 
II molecule, since pulsing of APC with this peptide results 
in a specific proliferative response against the peptide, 
and since CD4 T cell responses always are class II 
restricted. That this is the case is demonstrated by the 
results of the experiment shown in Figure 9. Here it is 
shown that the specific response against the peptide with 
seq. id. no. 17 is completely blocked by an antibody to 
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HLA-DR, but not with antibodies to the two other HLA class 
II molecules, HLA-DQ and -DP. Furthermore, by using a panel 
of standard homozygous Epstein Barr Virus (EBV) transformed 
B Cell Lines (BCD covering the relevant HLA class II 

5 molecules present on the patients own APC, we were able to 
identify the class II molecule responsible for presentation 
of the peptide with seq. id. no. 17 to TLC IMT8 and IMT9 as 
being HLA-DR 14. Together these findings fit extremely 
well with the immunohis tological observations made in 

10 parallel sections taken from the same tumour biopsy, where 
we could show that activated CD4+ T cells were abundant in 
the proximity of tumour cells that had been induced to 
express HLA-DR. molecules. The results in Figure 11 
demonstrate that these T cell clones are capable of 

15 mounting a proliferative response over a range of peptide 
doses and that the responses are dose dependent . 

5) Since these T cell clones were obtained by cloning T 
cells isolated from a tumour biopsy, another implication of 

20 our finding is that activated T cells specific for the 

peptide with seq. id. no. 17 are capable of homing to the 
tumour tissue after activation. 

6) Since the peptide with seq. id. no. 17 is a tumour 
25 specific antigen, and since frameshift mutations giving 

rise to this peptide or peptides with overlapping sequences 
are commonly found in cancers with defects in enzymes that 
are part of the mismatch repair machinery, this peptide may 
be used as a vaccine to elicit T cell response in cancer 

30 patients or patients at high risk for developing cancer. 

Such T cell responses may potentially influence the growth 
of an existing tumour or prohibit regrowth of tumour after 
surgery and other forms of treatment or be given to 
patients with an inheritable form of cancer where a defect 

35 mismatch enzyme is detected or suspected and that have a 
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high chance of developing a cancer where this precise 
mismatch repair mutation will occur. 



5 

Synthesis 

The peptides were synthesised by using continuous flow 
solid phase peptide synthesis. N-a-Fmoc -amino acids with 

0 appropriate side chain protection were used. The 
Fmoc-amino acids were activated for coupling as 
pentaf luorophenyl esters or by using either TBTU or 
diisopropyl carbodiimide activation prior to coupling. 
2 0% piperidine in DMF was used for selective removal of 

5 Fmoc after each coupling. Cleavage from the resin and 

final removal of side chain protection was performed by 
95% TFA containing appropriate scavengers. The peptides 
were purified and analysed by reversed phase (C18) HPLC. 
The identity of the peptides was confirmed by using 

0 electro-spray mass spectroscopy (Finnigan mat SSQ710) . 

The peptides used for in vitro studies of T cell 
stimulation were synthesised by this method. 

5 Several other well known methods can be applied by a person 
skilled in the art to synthesise the peptides. 



Examples of the method for determining new frameshift 
30 mutation peptides . 

In this Example, the BAX gene is used to illustrate the 
principle . 

35 In each of the steps listed below, the 1st line is the gene 
sequence and 2nd line is amino acid sequence. 
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In the steps 2-5, the outlined sequences represent the 
mutant part of the protein. 



5 Step one: 

Normal BAX . 

A TG GGG GGG GA G GCA CCC GAG CTG GCC CTG GAC CCG GTG 

10 MGGEAPELALDPV... 



Step two: 

15 

1G deleted from gene sequence. 

ATG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC CTC 

MGG RHPSWPWTRCI- 

20 

AGG ATG CGT CCA CCA AGA AGC TGA 
R M R P P R S stop 

25 

Step three: 

2G deleted from gene sequence. 

30 

ATG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 
MG GGTRAGPGPGA 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
35 SGCVHQEAERVS QA 



CAT CGG GGA CGA ACT GGA CAG TAA 
H R G. R T G Q stop 

40 
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Step four: 

1G inserted in gene sequence. 

ATG GGG GGG GGA GGC ACC CGA OCT GGC CCT GGA CCC GGT GCC 
MGGGGTRAGPGPGA 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 
SGCVHQEAERVSQA 

CAT CGG GGA CGA ACT GGA CAG TAA 
H R G R T G Q stop 



Step five: 

2G inserted in gene sequence. 

ATG GGG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC 
MGGGRHPSWPWTRC 

CTC AGG ATG CGT CCA CCA AGA AGC TGA 
L R M R P PRS Stop 



In the next Example, the TGFpRll gene is used to illustrate 
the principle. 

In each of the steps listed below, the 1st line is the gene 
sequence and 2nd line is amino acid sequence. 
In the steps 2-5, the outlined sequences represent the 
mutant part of the protein. 



35 



40 



45 



Step one: 
Normal TGF(3rii. 

GAA AAA AAA AA G CCT GGT GAG ACT TTC TTC ATG TGT TCC. . 
EKKKPGETFF MCS. 
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Step two : 



1A deleted from gene sequence. 

5 GAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT GTA 

EKKSLVRLS S CVPV 



GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG AAT 
ALMSAMTTSSSQKN 

ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 
ITPAILTCC stop 



15 

Step three: 

2A deleted from gene sequence. 

20 GAA AAA AAA GCC TGG TGA 

E K K A W stop 

25 Step four: 

1A inserted in gene sequence. 

GAA AAA AAA AAA GCC TGG TGA 
30 E K K K A W stop 



35 



40 



Step five: 

2A inserted in gene sequence. 

GAA AAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT 
EKKKSLVRLSSC V P 

GTA GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG 
VAIiMSAMTTSSSQK 



AAT ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 
45 NITPAILTCC stop 



Thus the peptides of the invention may be used in a 
50 method for the treatment of cancers with cancer cells 
harbouring genes with frameshift mutations, which 
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treatment comprises administering at least one peptide of 
the present invention in vivo or e^c vivo to a human 
patient in need of such treatment. 

5 In another embodiment the peptides of the invention may 
be used to vaccinate a human being disposed for cancers 
with cancer cells harbouring genes with frameshift 
mutations, by administering at least one peptide of the 
present invention to said human being. 



It is further considered to be an advantage to administer 
to a human individual a mixture of the peptides of this 
invention, whereby each of the peptides of the invention 
can bind to different types of HLA class I and/or class 
15 II molecules of the individual. 

It is further anticipated that the power of an anticancer 
vaccine or peptide drug as disclosed in the above 
mentioned PCT/N092 / 00032 application, can be greatly 

20 enhanced if the peptides of the present invention were 
included. Thus in another embodiment of the present 
invention peptides of the present invention are 
administered together with, either simultaneously or in 
optional sequence, with the peptides disclosed in 

25 PCT/NO92/00032. 

It is considered that the peptides may be administered 
together, either simultaneously or separately, with 
compounds such as cytokines and/or growth factors, i.e. 
30 interleukin-2 (IL-2), interleukin-12 (IL-12), granulocyte 
macrophage colony stimulating factor (GM-CSF) , Flt-3 
ligand or the like in order to strengthen the immune 
response as known in the art. 

35 The peptides according to the present invention can be 
used in a vaccine or a therapeutical composition either 



10 
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alone or in combination with other materials, such as for 
instance standard adjuvants or in the form of a 
lipopeptide conjugate which as known in the art can 
induce high-affinity cytotoxic T lymphocytes, (K. Deres , 
5 Nature, Vol.342, (nov.1989)). 

The peptides according to the present invention may be 
useful to include in either a peptide or recombinant 
fragment based vaccine. 

10 

The peptides according to the present invention can be 
included in pharmaceutical compositions or in vaccines 
together with usual additives, diluents, stabilisers or 
the like as known in the art. 

15 

According to this invention, a pharmaceutical composition 
or vaccine may include the peptides alone or in 
combination with at least one pharmaceutically acceptable 
carrier or diluent. 

20 

Further a vaccine or therapeutical composition can 
comprise a selection of peptides which are fragments of 
the mutant proteins arising from insertion or deletion of 
bases in a repeat sequence of the gene. 

25 

Further a vaccine composition can comprise at least one 
peptide 'selected for one cancer, which vaccine would be 
administered to a person carrying a genetic disposition 
for this particular cancer. 

30 

Further a vaccine composition can comprise at least one 
peptide selected for one cancer, which vaccine would be 
administered to a person belonging to a high risk group 
for this particular cancer. 

35 
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The cancer vaccine according to this invention may 
further be administered to the population in general for 
example as a mixture of peptides giving rise to T cell 
immunity against various common cancers connected with 
frameshift mutation genes. 

The peptides according to this invention may be 
administered as single peptides or as a mixture of 
peptides. Alternatively the peptides may be covalently 
linked with each other to form larger polypeptides or 
even cyclic polypeptides. 

A cancer therapy according to the present invention may 
be administered both in vivo or ex vivo having as the 
main goal the raising of specific T cell lines or clones 
against the mutant gene product associated with the 
cancer type with which the patient is afflicted. 

Further, the frameshift mutant peptides of this invention 
may be administered to a patient by various routes 
including but not limited to subcutaneous, intramuscular, 
intradermal, intraperitoneal, intravenous or the like. In 
one embodiment the peptides of this invention are 
administered intradermal ly . The peptides may be 
administered at single or multiple injection sites to a 
patient in a therapeutically or prophylactically 
effective amount. 

The peptides of this invention may be administered only 
once or alternatively several times, for instance once a 
week over a period of 1-2 months with a repeated sequence 
later all according to the need of the patient being 
treated . 

The peptides of this invention can be administered in an 
amount in the range of 1 microgram (1 fxg) to 1 gram (lg) 
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to an average human patient or individual to be 
vaccinated. It is preferred to use a smaller dose in the 
rage of 1 microgram (1 \ig) to 1 milligram (1 mg) for each 
administration. 

5 

The invention further encompasses DNA sequences which 
encodes a frameshift mutation peptide. 

The invention additionally encompasses isolated DNA 
10 sequences comprising a DNA sequence encoding at least one 
frameshift mutant peptide, and administration of such 
isolated DNA sequences as a vaccine for treatment or 
prophylaxis of cancers associated with frameshift 
mutations in the genes. 

15 

The peptides according to this invention may be 
administered to an individual in the form of DNA vaccines. 
The DNA encoding these peptides may be in the form of 
cloned plasmid DNA or synthetic oligonucleotide. The DNA 

20 may be delivered together with cytokines, such as IL-2 , 
and/or other co-stimulatory molecules. The cytokines 
and/or co-stimulatory molecules may themselves be delivered 
in the form of plasmid or oligonucleotide DNA. The response 
to a DNA vaccine has been shown to be increased by the 

25 presence of immunostimulatory DNA sequences (ISS). These 

can take the form of hexameric motifs containing methylated 
CpG, according to the formula: 

5 ' -purine-purine-CG-pyrimidine-pyrimidine-2 1 « Our DNA 
vaccines may therefore incorporate these or other ISS, in 
30 the DNA encoding the peptides, in the DNA encoding the 

cytokine or other co-stimulatory molecules, or in both. A 
review of the advantages of DNA vaccination is provided by 
Tighe et al (1998, Immunology Today, 19 (2), 89-97). 

35 In one embodiment, the DNA sequence encoding the mutant 
BAX peptides comprises: 
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Normal BAX. 

5 

ATG_GGG_GGG_GAG GGA CCC GAG GTG GCG GTG GAC GCG GTG 

10 1G deleted from BAX gene sequence. 

ATG GGG GGG AGG CAC CCG AGC TGG CCC TGG ACC CGG TGC CTC 
AGG ATG CGT CCA CCA AGA AGC TGA 

15 

2G deleted from BAX gene sequence. 
20 ATG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 

CAT CGG GGA CGA ACT GGA CAG TAA 

25 

1G inserted in BAX gene sequence. 
30 ATG GGG GGG GGA GGC ACC CGA GCT GGC CCT GGA CCC GGT GCC 

TCA GGA TGC GTC CAC CAA GAA GCT GAG CGA GTG TCT CAA GCG 

CAT CGG GGA CGA ACT GGA CAG TAA 

35 

2G inserted in BAX gene sequence. 
40 ATG GGG GGG GGG ZZ! CAC CCG AGC TGG CCC TGG ACC CGG TGC 

CTC AGG ATG CGT CCA CCA AGA AGC TGA 

45 

In a second embodiment, the DNA sequence encoding the 
mutant TGF(3RII peptides comprises: 

50 
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Normal TGFpRII gene. 

GAA AAA AAA AA G CCT GGT GAG ACT TTC TTC ATG TGT TCC . . . . 

5 



1A deleted from TGF|3RII gene sequence. 

GAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT GTA 

10 

GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG AAT 
ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 



15 



2A deleted from TGFpRII gene sequence. 
GAA AAA AAA GCC TGG TGA 

20 



1A inserted in TGFfiRII gene sequence. 

25 

GAA AAA AAA AAA GCC TGG TGA 



30 

2A inserted in TGF(3RII gene sequence. 

GAA AAA AAA AAA A GC CTG GTG AGA CTT TCT TCA TGT GTT CCT 
35 GTA GCT CTG ATG AGT GCA ATG ACA ACA TCA TCT TCT CAG AAG 

AAT ATA ACA CCA GCA ATC CTG ACT TGT TGC TAG 

40 

The invention further encompasses vectors and plasmids 
comprising a DNA sequence encoding a frameshift mutant 
peptide. The vectors include, but are not limited to 
E.Coli plasmid, a Listeria vector and recombinant viral 
45 vectors. Recombinant viral vectors include, but are not 
limited to orthopox virus, canary virus, capripox virus, 
suipox virus, vaccinia, baculovirus, human adenovirus, 
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SV40, bovine papilloma virus and the like comprising the 
DNA sequence encoding a frameshift mutant peptide. 

It is considered that an anticancer treatment or 
prophylaxis may be achieved also through the 
administration of an effective amount of a recombinant 
virus vector or plasmid comprising at least one insertion 
site containing a DNA sequence encoding a frameshift 
mutant peptide to a patient, whereby the patient's 
antigen presenting cells are turned into host cells for 
the vector /plasmid and presemtation of HLA/ frameshift 
mutation peptide complex is achieved. 

A person skilled in the art will find other possible use 
combinations with the peptides of this invention, and 
these are meant to be encompassed by the present claim. 

The peptides according to this invention may be produced 
by conventional processes as known in the art, such as 
chemical peptide synthesis, recombinant DNA technology or 
protease cleavage of a protein or peptide encoded by a 
frameshift mutated gene. One method for chemical 
synthesis is elucidated in the description below. 

In order for a cancer vaccine and methods for specific 
cancer therapy based on specific T cell immunity to be 
effective, three conditions must be met: 
1. The peptides used must correspond, either in their 
full length or after processing by antigen presenting 
cells, to the processed mutant protein fragment as 
presented by a HLA Class I and/or class II molecule on 
the cancer cell or other antigen presenting cells, 
2 . The peptides used must be bound to a HLA Class I 
and/or Class II molecule in an immunogenic form, and 
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3. T-cells capable of recognising and responding to the 
HLA/peptide complex must be present in the circulation of 
the human being. 

5 It has been established that all these conditions are met 
for some representative peptides according to the present 
invention. The peptides according to the present 
invention give rise to specific T cell immune responses 
in vitro. It has been established that the peptides 
10 according to this invention correspond to processed 
mutant protein fragments. This is exemplified with 
peptides corresponding to fragments of transformed mutant 
BAX and TGF(3RII peptides. 

15 Through the present invention the following advantages 
are achieved: 

- It offers a possibility to treat patients suffering 
from cancers arising from frame-shift mutations in their 
genes, most of which cancers known at present do not have 

20 any good treatment alternatives. 

- It offers a possibility to vaccinate prophylaxtically 
humans carrying genetic dispositions or belonging to 
other high risk groups. 

- It offers a possibility to prepare a combination 

25 treatment for a specific cancer, such as for instance 
colorectal or pancreatic cancers, wherein the cancer 
commonly is associated with either a frameshift mutation 
or a point mutation in the genes. 

-Since described frameshift mutations occurs in a large 
30 variety of cancers it will be possible to use this 

peptides in combination with established vaccines and 
future vaccines to obtain a multiple targetting 
treatment . 

-Likewise patients suffering from cancers associated with 
35 multiple frameshift mutations in genes can be treated 
more efficiently through a combination treatment. 
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Claims 

1. A peptide characterised i 



PCT/NO99/00143 



10 



20 



n that it 



a) is at least 8 amino acids long and is a 
fragment of a mutant protein arising from a 
frameshift mutation in a gene of a cancer cell; 



and 



b) consists of at least one amino acid of the 
mutant part of a protein sequence encoded by said 



gene ; 



15 and 



c) comprises 0-10 amino acids from the carboxyl 
terminus of the normal part of the protein 
sequence preceding the amino terminus of the 
mutant sequence and may further extend to the 
carboxyl terminus of the mutant part of the 
protein as determined by a new stop codon 
generated by the frameshift mutation; 

25 and 

d) induces, either in its full length or after 
processing by anti', • presenting cell, T cell 
responses . 

30 

2. A peptide according to claim 1 characterised in that it 
contain 8-25 amino acids. 

3. A peptide according to claim 1 characterised in that it 
35 contain 9-20 amino acids. 



BNSOOCID: <WO 9958552A2 I > 



WO 99/58552 %M PCT/NO99/00143 

67 

4. A peptide according to claim 1 characterised in that it 
contain 9-16 amino acids. 

5. A peptide according to claim 1 characterised in that it 
5 contain 8-12 amino acids. 

6. A peptide according to claim 1 characterised in that it 
contain 20-25 amino acids. 

10 7. A peptide according to claim 1 characterised in that it 
contains 9 amino acids. 

8. A peptide according to claim 1 characterised in that it 
contains 12 amino acids . 

15 

9. A peptide according to claim 1 characterised in that it 
contains 13 amino acids. 

10. A peptide according to claim 1 characterised in that it 
20 is a fragment of a mutant protein encoded by a frameshift 

mutation in BAX gene or TGFj3RII gene. 



25 



30 



35 
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11. A peptide according to claim 1 characterised in that it 
is a fragment of a mutant protein encoded by a frameshift 
mutation in hTGF02 gene, DCC gene, BRCA1 gene, BRCA2 gene, 
hPTP gene, top2 gene, TTK gene, CTCF gene, Human 
FADD-homologous ICE/CED-3-like protease gene, hMSH3 gene, 
hRBPl gene, hFMRl gene, Human TINUR gene, b-raf oncogene, 
NF1 gene, Human germline n-myc gene, Human n-myc gene, Human 
ras inhibitor gene, hMSH6 gene, Human nasopharynx carcinoma 
EBV BNLF-1 gene, Human cell cycle regulatory protein 
(ElA-binding protein) p300 gene, bcl-3) gene, BIGH3 , Human 
transcription factor ETV1 gene, IGFBP4 gene, Human MUC1 
gene, JAK1 gene, JAK3 gene, Human Flt4 gene. Human p53 ' 
associated gene, hCAN gene, hDBL proto-oncogene/hMCF2PO 
gene, hDEK gene, P 107 gene, hGPRl gene, hRBP56 gene, hITF-2 
gene, hKiSS-1 gene, hTP-1 gene, hFDF-5 gene, hMTAl gene, 
hTFHB90 gene, hLUCA-1 gene, Human Wilm's tumour (WIT-1) 
associated protein, ICErel-IIl gene, FasL gene,BARDl gene, 
hMCF.2 gene, fas gene and Human DPC4 gene. 

20 12. A peptide according to claim 1 characterised in that it 
is selected from a group of peptides having the following 
sequence identity numbers: 

seq. id. nos. 1-21, seq. id. no. 428, seq. id. no. 438 and 
seq. id. nos. 456-458 or a fragment of any of these. 



15 



25 



30 



35 



13. A peptide according to claim 1 characterised in that it 
is selected from a group of peptides having the following 
sequence identity numbers: 

seq. id. nos. 22-427, seq. id., nos. 429-437, seq. id. nos. 
439-455 and seq. id. no. 459 or a fragment of any of these. 

14. A pharmaceutical composition comprising a peptide 
according to any of the above claims and a pharmaceutical^ 
acceptable carrier or diluent. 
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15. A cancer vaccine comprising a peptide according to any 
of the claims 1-13 and a pharmaceutically acceptable 
carrier or diluent. 

5 16. Use of a peptide according to any of the claims 1-13 
for the preparation of a pharmaceutical composition for 
treatment or prophylaxis of cancer. 

17 . Method for vaccination of a person disposed for or 

10 afflicted with cancer, consisting of administering at least 
one peptide according to the claims 1-13, one or more 
times, in an amount sufficient for induction of specific 
T-cell immunity to the mutant proteins or fragments thereof 
encoded by a frameshift mutated gene. 

15 

18 . Method according to claim 17 wherein the amount of the 
peptides is in the range of 1 microgram (1 \xg) to 1 gram 
(lg) and preferentially in the rage of 1 microgram (1 p.g) 
to 1 milligram (1 mg) for each administration. 

20 

19. Method for treatment of a patient afflicted with cancer 
by stimulating in vivo or ex vivo with peptides, according 
to the claims 1-13. 

25 20. Method according to claim 19 wherein the amount of the 
peptides used is in the range of 1 microgram (1 jxg) to 1 
gram (lg) and preferentially in the rage of 1 microgram (1 
[ig) to 1 milligram (1 mg) for each administration. 

30 21. A pharmaceutical composition or vaccine composition 

comprising a combination of at least one peptide according 
to claims 1-13 and at least one peptide according to 
PCT/NO92/00032 . 
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22. A method for identifying new peptides which correspond 
to fragments of proteins arising from frameshift mutations 
in genes, characterised by the following steps: 

1) identifying a gene in a cancer cell susceptible 
to frameshift mutation by having a mono nucleoside 
base repeat sequence of at least five residues, or 
a di -nucleoside base repeat sequence of at least 
four di-nucleoside base units; 

and 



2) removing, respectively, one nucleoside base 
residue or one di-nucleoside base unit from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered . 
sequence as far as to include a new stop codon; 



and/ or 



3) removing, respectively, two nucleoside base 
residues or two di-nucleoside base units from the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gei 
sequence as far as to include a new stop codon; 

and/ or 



4) inserting, respectively, one nucleoside base 
residue or one di-nucleoside base unit in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered c 
sequence as far as to include a new stop codon; 



and/or 



WO 99/58552 
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5) inserting, respectively, two nucleoside base 
residues or two di -nucleoside base units in the 
repeat sequence and identifying the amino acid 
sequence of the protein encoded by the altered gene 
5 sequence as far as to include a new stop codon. 



23. A method according to claim 22, 
10 charactersed it that it includes the 
following steps: 



6) determining whether the new peptides, either in 
their full length or as shorter fragments of the 

15 peptides, are able to stimulate T cells; 

and optionally 

7) determining peptides containing nested epitopes 
20 for different major HLA class I and/or HLA class II 

molecules . 



24. An isolated DNA sequence comprising a DNA sequence or 
variants thereof encoding a f rameshif t mutant peptide 

25 according to claim 1. 

25. An isolated DNA sequence encoding peptides comprising 
seq. id. nos . 1-21, seq. id. no. 428, seq. id. no. 438 and 
seq. id. nos. 456-458 or variants thereof. 

30 

26. An isolated DNA sequence encoding peptides comprising 
seq. id. nos. 22-427, seq. id. nos. 429-437, seq. id. nos. 
439-455 and seq. id. no. 459 or variants thereof. 
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27. Use of a DNA sequence according to any of the claims 
24-2 6 for the preparation of a pharmaceutical composition 
for treatment or prophylaxis of cancer. 

28. Method for treatment of a person disposed for or 
afflicted with cancer, by stimulating in vivo or ex vivo 
with DNA sequences according to the claims 24-26. 

29. A plasmid or virus vector comprising the DNA sequence 
of claim 24 encoding a frameshift mutant peptide. 

30. A vector according to claim 29 wherein the vector is 
E.Coli plasmid, a Listeria vector and recombinant viral 
vectors. Recombinant viral vectors include, but are not 
limited to orthopox virus, canary virus, capripox virus, 
suipox virus, vaccinia, baculovirus, human adenovirus, SV40 
or bovine papilloma virus. 

31. Use of a plasmid or virus vector according to claim 29 
for the preparation of a pharmaceutical composition for 
treatment or prophylaxis of cancer. 

32. Method for treatment of a person disposed for or 
afflicted with cancer, by stimulating in vivo or ex vivo 
with plasmids or virus vectors according to claim 29. 
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Sequence iden ti ty list 

SEQUENCE LISTING 

COMMON FOR ALL SEQUENCES. 
SEQUENCE TYPE: Peptide 
SEQUENCE UNIT: Amino Acid 
TOPOLOGY: Linear 



SEQUENCE ID NO: 1 

SEQUENCE LENGTH: 18 amino acids 
RHPSWPWTRCLRMRPPRS 
15 10 15 

SEQUENCE ID NO: 2 

SEQUENCE LENGTH: 31 amino acids 

GTRAGPGPGASGCVHQEAERVSQAHRGRTG 
1 5 10 15 20 25 30 

Q 

SEQUENCE ID NO: 3 

SEQUENCE LENGTH: 32 amino acids 

GGTRAGPGPGASGCVHQEAERVSQAHRGRT 
1 5 10 15 20 25 30 

G Q 

SEQUENCE ID NO: 4 

SEQUENCE LENGTH: 19 amino acids 
GRHPSWPWTRCLRMRPPRS 
15 10 15 
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SEQUENCE ID NO: 5 

SEQUENCE LENGTH: 28 amino acids 

IQDRAGRMGGRHPSWPWTRCLRMRPPRS 
1 5 10 15 20 25 



SEQUENCE ID NO: 6 

SEQUENCE LENGTH: 19 amino acids 
IQDRAGRMGG'GRHPSWPWT 
15 10 15 



SEQUENCE ID NO: 7 

SEQUENCE LENGTH: 42 amino acids 

1 QDRAGRMGGGGTRAGPGPGASGCVHQEAE 
1 5 10 15 20 25 30 

RVSQAHRGRTGQ 
35 40 



SEQUENCE ID NO: 8 

SEQUENCE LENGTH: 19 amino acids 

I QDRAGRMGGGTRAGPGPG 

15 10 15 



SEQUENCE ID NO: 9 

SEQUENCE LENGTH: 22 amino acids 

I QDRAGRMGGRHPSWPWTRCLR 

15 10 15 20 



SEQUENCE ID NO: 10 

SEQUENCE LENGTH: 22 amino acids 

ASGCVHQEAERVS QAHRGRTGQ 

15 10 15 20 



SEQUENCE ID NO: 11 

SEQUENCE LENGTH: 22 amino acids 

GGTRAG PGPGASGCVHQEAERV 

15 10 15 20 
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SEQUENCE ID NO: 12 

SEQUENCE LENGTH: 22 amino acids 
IQDRAGRMGGGGTRAGPGPGAS 
15 10 15 20 



SEQUENCE ID NO: 13 

SEQUENCE LENGTH: 34 amino acids 

SLVRLSSCVPVALMSAMTTSSSQKNITPAI 
1 5 10 15 20 25 30 

L T C C 



SEQUENCE ID NO: 14 

SEQUENCE LENGTH: 4 4 amino acids 

SPKC IMKEKKSLVRLSSCVPVALMSAMTTS 
1 5 10 15 20 25 30 

SSQKNITPAILTCC 
35 40 

SEQUENCE ID NO: 15 

SEQUENCE LENGTH: 19 amino acids 
PKC IMKEKKKSLVRLSSCV 
1 5 10 15 



SEQUENCE ID NO: 16 

SEQUENCE LENGTH: 23 amino acids 

ALMSAMTTSSSQKNITPAILTCC 
15 10 15 20 

SEQUENCE ID NO: 17 

SEQUENCE LENGTH: 23 amino acids 

SLVRLSSCVPVALMSAMTTSSSQ 
15 10 15 20 
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SEQUENCE ID NO: 18 

SEQUENCE LENGTH: 22 amino acids 

S PKCIMKEKKSLVRLSSCVPVA 

15 10 15 20 



SEQUENCE ID NO: 19 
SEQUENCE LENGTH: 12 amino acids 
S P K C I M K E K K A W 
15 10 

SEQUENCE ID NO: 20 

SEQUENCE LENGTH: 12 amino acids 

PKCIMKEKKKAW 

1 5 10 



SEQUENCE ID NO: 21 

SEQUENCE LENGTH: 19 amino acids 

AMTTSSSQKNITPAILTCC 

1 5 10 15 



SEQUENCE ID NO: 22 

SEQUENCE LENGTH: 9 amino acids 

TVGRPHISC 

1 5 



SEQUENCE ID NO: 23 

SEQUENCE LENGTH: 10 amino acids 

KTVGRPHISC 

15 10 



SEQUENCE ID NO: 24 

SEQUENCE LENGTH: 18 amino acids 
KQWEDPTSPANVIALLQT 
15 10 15 
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SEQUENCE ID NO: 25 

SEQUENCE LENGTH: 17 amino acids 

QWEDPTSPANVIALLQT 

15 10 15 

SEQUENCE ID NO: 26 

SEQUENCE LENGTH: 19 amino acids 
QKTIKSTRKKTVGRPHISC 
15 10 15 

SEQUENCE ID NO: 27 

SEQUENCE LENGTH: 20 amino acids 
QKTIKSTRKKKTVGRPHISC 
15 10 15 20 

SEQUENCE ID NO: 28 

SEQUENCE LENGTH: 28 amino acids 

QKTIKSTRKKKQWEDPTSPANVIALLQT 
1 5 10 15 20 25 

SEQUENCE ID NO: 29 

SEQUENCE LENGTH: 27 amino acids 

QKTIKS TRKKQWEDPTSPANVIALLQT 
1 5 10 15 20 25 

SEQUENCE ID NO: 30 

SEQUENCE LENGTH: 34 amino acids 

AADLQQQFVHFLDCWDVSS IPFTLHLPQAQ 
1 5 10 15 20 25 30 

D I T T 

SEQUENCE ID NO: 31 

SEQUENCE LENGTH: 9 amino acids 

GKDAKEKSS 

1 5 
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SEQUENCE ID NO: 32 

SEQUENCE LENGTH: 10 amino acids 
GKDAKEKKSS 
15 10 



SEQUENCE ID NO: 33 

SEQUENCE LENGTH: 42 amino acids 

GKDAKEKKAADLQQQFVH FLDCWDVS S I PF 
1 5 10 15 20 25 30 

TLHLPQAQD I TT 
35 40 



SEQUENCE ID NO: 34 

SEQUENCE LENGTH: 41 amino acids 

GKDAKEKAADLQQQFVHFLDCWDVSS IPFT 
1 5 10 15 20 25 30 

LHLPQAQDI TT 
35 40 



SEQUENCE ID NO: 35 

SEQUENCE LENGTH: 9 amino acids 
FSMKQTLMNVKNLKTK 
15 10 15 



SEQUENCE ID NO: 36 

SEQUENCE LENGTH: 17 amino acids 

KFSMKQTLMNVKNLKTK 

1 5 10 .15 



SEQUENCE ID NO: 37 

SEQUENCE LENGTH: 25 amino acids 

VRTSKTRKKFSMKQTLMNVKNLKTK 
1 5 10 15 20 25 
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SEQUENCE ID NO: 38 

SEQUENCE LENGTH: 2 6 amino acids 

VRTSKTRKKKFSMKQTLMNVKNLKTK 
1 5 10 15 20 25 

SEQUENCE ID NO: 39 

SEQUENCE LENGTH: 12 amino acids 
VRTSKTRKKNFP 
15 10 

SEQUENCE ID NO: 40 

SEQUENCE LENGTH: 11 amino acids 

VRTSKTRKNFP 

1 5 10 

SEQUENCE ID NO: 41 

SEQUENCE LENGTH: 10 amino acids 
IKKK. LLQFQK 
15 10 

SEQUENCE ID NO: 42 

SEQUENCE LENGTH: 11 amino acids 
KIKKKLLQFQK 
15 10 

SEQUENCE ID NO: 43 

SEQUENCE LENGTH: 11 amino acids 
KSRRNYFNFKNNCQSRL 
15 10 15 

SEQUENCE ID NO: 44 

SEQUENCE LENGTH: 16 amino acids 
SRRNYFNFKNNCQSRL 
1 5 10 15 
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SEQUENCE ID NO: 45 

SEQUENCE LENGTH: 18 amino acids 
TNLRVI QKIKKKLLQFQK 
15 10 15 



SEQUENCE ID NO: 46 

SEQUENCE LENGTH: 19 amino acids 
TNLRVI QKKIKKKLLQFQK 
15 10 15 



SEQUENCE ID NO: 47 

SEQUENCE LENGTH: 25 amino acids 

TNLRVI QKKSRRNYFNFKNNCQSRL 
15 10 15 20 25 

SEQUENCE ID NO: 48 

SEQUENCE LENGTH: 24 amino acids 

TNLRVI QKSRRNYFNFKNNCQSRL 
15 10 15 20 



SEQUENCE ID NO: 4 9 

SEQUENCE LENGTH: 5 amino acids 
K I M I T 
1 5 



SEQUENCE ID NO: 50 

SEQUENCE LENGTH: 12 amino acids 
NIDKI PEKIMIT 
15 10 



SEQUENCE ID NO: 51 

SEQUENCE LENGTH: 13 amino acids 

NIDKIPEKKIMIT 

1 5 10 
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SEQUENCE ID NO: 52 

SEQUENCE LENGTH: 5 amino acids 
I I N A N 
1 5 

SEQUENCE ID NO: 53 

SEQUENCE LENGTH: 6 amino acids 
K I I N A N 
1 5 

SEQUENCE ID NO: 54 

SEQUENCE LENGTH: 13 amino acids 
NDKTVSEKI INAN 
15 10 

SEQUENCE ID NO: 55 

SEQUENCE LENGTH: 14 amino acids 
NDKTVSEKKI INAN 
1 5 10 

SEQUENCE ID NO: 56 

SEQUENCE LENGTH: 14 amino acids 

NGLEKEYLMVNQKE 

15 10 

SEQUENCE ID NO: 57 

SEQUENCE LENGTH: 23 amino acids 

SQTSLLEAKNGLEKEYLMVNQKE 
15 10 15 20 

SEQUENCE ID NO: 58 

SEQUENCE LENGTH: 24 amino acids 

SQTSLLEAKKNGLEKEY L M V N Q K E 
15 10 15 20 
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SEQUENCE ID NO: 59 

SEQUENCE LENGTH: 12 amino acids 
SQTSLLEAKKMA 
15 10 

SEQUENCE ID NO: 60 

SEQUENCE LENGTH: 11 amino acids 
SQTSLLEAKMA 
15 10 

SEQUENCE ID NO: 61 

SEQUENCE LENGTH: 6 amino acids 
T L V F P K 
1 5 

SEQUENCE ID NO: 62 

SEQUENCE LENGTH: 7 amino acids 
K T L V F P K 
1 5 

SEQUENCE ID NO: 63 

SEQUENCE LENGTH: 14 amino acids 

LKNVEDQKTLVFPK 

15 10 

SEQUENCE ID NO: 64 

SEQUENCE LENGTH: 15 amino acids 
LKNVEDQKKTLVFPK 
15 10 15 

SEQUENCE ID NO: 65 

SEQUENCE LENGTH: 10 amino acids 

LKNVEDQKKH 

15 10 
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SEQUENCE ID NO: 66 

SEQUENCE LENGTH: 9 amino acids 

LKNVEDQKH 

1 5 

SEQUENCE ID NO: 67 

SEQUENCE LENGTH: 6 amino acids 
K K I Q L Y 
1 5 

SEQUENCE ID NO: 68 

SEQUENCE LENGTH: 7 amino acids 
K K K I Q L Y 
1 5 

SEQUENCE ID NO: 69 

SEQUENCE LENGTH: 36 amino acids 

RKRFSYTEYLAS I IRFIFSVNRRKEIQNLS 
1 5 10 15 20 25 30 

S C N F K I 
35 

SEQUENCE ID NO: 70 

SEQUENCE LENGTH: 15 amino acids 
LRIVSYSKKKKIQLY 
15 10 15 

SEQUENCE ID NO: 71 

SEQUENCE LENGTH: 16 amino . acids 
LRIVSYSKKKKKIQLY 
1 „ 5 10 15 
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SEQUENCE ID NO: 72 

SEQUENCE LENGTH: 4 5 amino acids 
LRIVSYSKKRKRFSYT 
15 10 15 

RRKEIQNLSSCNFKI 
35 40 45 
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EYLAS I IRFI FSVN 
20 25 30 



SEQUENCE ID NO: 73 

SEQUENCE LENGTH: 44 amino acids 

LRIVSYSKRKRFSYTEYLAS I IRFI FSVNR 
1 5 10 15 20 25 

30 

RKEIQNLSSCNFKI 
35 40 

SEQUENCE ID NO: 74 

SEQUENCE LENGTH: 18 amino acids 
QDLPLSSICQTIVTIYWQ 
1 5 10 15 

SEQUENCE ID NO: 75 

SEQUENCE LENGTH: 19 amino acids 
KQDLPLSS I CQTIVTIYWQ 
15 10 15 

SEQUENCE ID NO: 7 6 

SEQUENCE LENGTH: 25 amino acids 

NRTCP FRL FVRRMLQFTGNKVLDRP 
15 10 15 20 25 

a. 

SEQUENCE ID NO: 77 

SEQUENCE LENGTH: 27 amino acids 

GFVVSVVKKQDLPLSSICQTIVTIYWQ 
1 5 10 15 20 25 
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SEQUENCE ID NO: 78 

SEQUENCE LENGTH: 28 amino acids 

GFVVSVVKKKQDLPLSSICQTIVTIYWQ 
15 10 15 20 25 

SEQUENCE ID NO: 79 

SEQUENCE LENGTH: 34 amino acids 

GFVVSVVKKNRTCP.FRLFV.RRMLQFTGNKV 
1 5 10 15 20 25 30 

L D R P 

SEQUENCE ID NO: 80 

SEQUENCE LENGTH: 33 amino acids 

GFVVSVVKNRTCPFRLFVRRMLQFTGNKVL 
1 5 10 15 20 25 30 

D R P 

SEQUENCE ID NO: 81 

SEQUENCE LENGTH: 8 amino acids 

YRKTKNQN 

1 5 

SEQUENCE ID NO: 82 

SEQUENCE LENGTH: 9 amino acids 

KYRKTKNQN 

1 5 

SEQUENCE ID NO: 83 

SEQUENCE LENGTH: 10 amino acids 

NTERPKIRTN 

15 10 

SEQUENCE ID NO: 84 

SEQUENCE LENGTH: 17 amino acids 
DETFYKGKKYRKTKNQN 
15 10 15 
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SEQUENCE ID NO: 85 

SEQUENCE LENGTH: 18 amino acids 
DET FYKGKKKYRKTKNQN 
15 10 15 

SEQUENCE ID NO: 86 

SEQUENCE LENGTH: 19 amino acids 
DETFYKGKKNTERPKIRTN 
15 10 15 

SEQUENCE ID NO: 87 

SEQUENCE LENGTH: 18 amino acids 
DETFYKGKNTERPKIRTN 
15 10 15 

SEQUENCE ID NO: 88 

SEQUENCE LENGTH: 28 amino acids 

LS INNYRFQMKFYFRFTSHGS P FTSANF 
1 5 10 15 20 25 

SEQUENCE ID NO: 89 

SEQUENCE LENGTH: 29 amino acids 

KLS INNYRFQMKFYFRFTSHGS PFTSANF 
1 5 10 15 20 25 

SEQUENCE ID NO: 90 

SEQUENCE LENGTH: 10 amino acids 

NSVSTTTGFR 

15 10 

SEQUENCE ID NO: 91 

SEQUENCE LENGTH: 37 amino acids 

N I QLAATKKLS INNYRFQMKFY FRFTSHGS 
1 5 10 15 20 25 30 

PFTSANF 
35 
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SEQUENCE ID NO: 92 

SEQUENCE LENGTH: 38 amino acids 

NIQLAATKKKLS INNYRFQMKFYFRFTSHG 
1 5 10 15 20 25 30 

SPFTSANF 
35 

SEQUENCE ID NO : 93 

SEQUENCE LENGTH: 19 amino acids 

NIQLAATKKNSVSTTTGFR 

15 10 15 

SEQUENCE ID NO: 94 

SEQUENCE LENGTH: 18 amino acids 

NIQLAATKNSVSTTTGFR 

15 10 15 

SEQUENCE ID NO: 95 

SEQUENCE LENGTH: 18 amino acids 
MEHVAPGRMSASPQSPTQ 
15 10 15 

SEQUENCE ID NO: 96 

SEQUENCE LENGTH: 19 amino acids 
KMEHVAPGRMSASPQS PTQ 
15 10 15 

SEQUENCE ID NO: 97 

SEQUENCE LENGTH: 59 amino acids 

KWSTWLQAECQHLHSPQRSDKPQ Q A G L D Q Q 
1 5 10 15 20 25 30 

HHCFALDSSPGPRPVFLQLLGLMGQGRHD 
35 40 45 50 55 
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SEQUENCE ID NO: 98 

SEQUENCE LENGTH: 58 amino acids 

WSTWLQAECQHLHS PQRSDKPQQAGLDQQH 
1 5 10 15 20 25 30 

HCFALDSSPGPRPVFLQLLGLMGQGRHD 
35 40 45 50 55 



SEQUENCE ID NO: 99 

SEQUENCE LENGTH: 2 6 amino acids 

T FSVWAEKME HVAPGRMSAS PQS PTQ 
1 5 10 15 20 25 



SEQUENCE ID NO: 100 

SEQUENCE LENGTH: 27 amino acids 

TFSVWAEKKMEHVAPGRMSASPQSPTQ 
1 5 10 15 20 25 



SEQUENCE ID NO: 101 

SEQUENCE LENGTH: 67 amino acids 

TFSVWAEKKWSTWLQAECQHLHSPQRSDKP 
1 5 10 15 20 25 30 

QQAGLDQQHHCFALDSSPGPRPVFLQLLGL 

35 40 45 50 55 60 

M G Q G R H D 

65 



SEQUENCE ID NO: 102 

SEQUENCE LENGTH: 66 amino acids 
TFSVWAEKWSTWLQAE 
15 10 15 

QAGLDQQHHC FALDSS 

35 40 45 

G Q G R H D 

65 



CQHLHSPQRSDKPQ 
20 25 30 

PGPRPVFLQLLGLM 
50 55 60 
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SEQUENCE ID NO: 103 
SEQUENCE LENGTH: 18 amino acids 
HKWLKFCLLRLVKES FHE 
15 10 15 

SEQUENCE ID NO: 104 

SEQUENCE LENGTH: 19 amino acids 
KHKWLKFCLLRLVKESFHE 
15 10 15 

SEQUENCE ID NO: 105 

SEQUENCE LENGTH: 27 amino acids 

KGGKAKGKKHKWLKFCLLRLVKES FHE 
1 5 10 15 20 25 

SEQUENCE ID NO: 106 

SEQUENCE LENGTH: 2 8 amino acids 

KGGKAKGKKKHKWLKFCLLRLVKES FHE 
1 5 10 15 20 25 

SEQUENCE ID NO: 107 

SEQUENCE LENGTH: 13 amino acids 
KGGKAKGKKNTNG 
15 10 

SEQUENCE ID NO: 108 

SEQUENCE LENGTH: 12 amino acids 

KGGKAKGKNTNG 

1 5 10 

SEQUENCE ID NO: 109 

SEQUENCE LENGTH: 8 amino acids 

VNNFFKKL 

1 5 
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SEQUENCE ID NO: 110 

SEQUENCE LENGTH: 9 amino acids 
KVNNFFKKL 
1 5 

SEQUENCE ID NO: 111 

SEQUENCE LENGTH: 16 amino acids 
LSQGNVKKVNNFFKKL 
15 10 15 

SEQUENCE ID NO: 112 

SEQUENCE LENGTH: 17 amino acids 
LSQGNVKKKVNNFFKKL 
15 10 15 

SEQUENCE ID NO: 113 
SEQUENCE LENGTH: 38 amino acids 
GEKNDLQLFVMSDRRYKIYWTVILLNPCGN 
1 5 10 15 20 25 30 

LHLKTTSL 
35 

SEQUENCE ID NO: 114 

SEQUENCE LENGTH: 39 amino acids 

KGEKNDLQLFVMSDRRYKIYWTVI LLNPCG 
1 5 10 15 20 25 30 

NLHLKTTSL 
35 

SEQUENCE ID NO: 115 

SEQUENCE LENGTH: 10 amino acids 

KGKKMICSYS 

15 10 
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SEQUENCE ID NO: 116 

SEQUENCE LENGTH: 9 amino acids 

GKKMICSYS 

1 5 
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SEQUENCE ID NO: 117 

SEQUENCE LENGTH: 4 6 amino acids 

SSKTFEKKGEKNDLQLFVMSDRRYKIYWTV 
1 5 10 15 20 25 30 

ILLNPCGNLHLKTTSL 
35 40 45 

SEQUENCE ID NO: 118 

SEQUENCE LENGTH: 4 7 amino acids 

SSKTFEKKKGEKNDLQLFVMS DRRYKI YWT 
1 5 10 15 20 25 30 

VI LLNPCGNLHLKTTSL 
35 30 45 

SEQUENCE ID NO: 119 

SEQUENCE LENGTH: 18 amino acids 
SSKTFEKKKGKKMICSYS 
1 5 10 15 

SEQUENCE ID NO: 120 

SEQUENCE LENGTH: 17 amino acids 
SSKTFEKKGKKMICSYS 
15 10 15 

SEQUENCE ID NO: 121 

SEQUENCE LENGTH: 17 amino acids 
QRKPKRA-NCVI QRRAKM 
15 10 15 
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SEQUENCE ID NO: 122 

SEQUENCE LENGTH: 18 amino acids 
KQRKPKRANCVI QRRAKM 
15 10 15 



SEQUENCE ID NO: 123 

SEQUENCE LENGTH: 26 amino acids 

NKENQKEQTALLYRGGQRCRCVCLRF 
1 5 10 15 20 25 



SEQUENCE ID NO: 124 

SEQUENCE LENGTH: 2 6 amino acids 

PDYQPPAKKQRKPKRANCVIQRRAKM 
15 10 15 20 25 



SEQUENCE ID NO: 125 

SEQUENCE LENGTH: 27 amino acids 

PDYQPPAKKKQRKPKRANCVI QRRAKM 
15 10 15 20 25 



SEQUENCE ID NO: 126 

SEQUENCE LENGTH: 35 amino acids 

PDYQPPAKKNKENQKEQTALLYRGGQRCRC 
1 5 10 15 20 25 30 

V C L R F 
35 



SEQUENCE ID NO: 127 

SEQUENCE LENGTH: 34 amino acids 

PDYQPPAKNKENQKEQTALLYRGGQRCRCV 
1 5 10 15 20 25 30 

C L R F 



BNSDOCID: <WO. . 9958S52A2. 1 . > 
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SEQUENCE ID NO: 128 

SEQUENCE LENGTH: 7 amino acids 

N L S S L L I 

1 5 

SEQUENCE ID NO: 129 

SEQUENCE LENGTH: 5 amino acids 
T C L P F 
1 5 

SEQUENCE ID NO: 130 

SEQUENCE LENGTH: 15 amino acids 
QPTFTLRKNLSSLLI 
15 10 15 

SEQUENCE ID NO: 131 

SEQUENCE LENGTH: 16 amino acids 
QPTFTLRKKNLSSLLI 
15 10 15 

SEQUENCE ID NO: 132 

SEQUENCE LENGTH: 14 amino acids 
QPTFTLRKKTCLPF 
1 5 10 

SEQUENCE ID NO: 133 

SEQUENCE LENGTH: 13 amino acids 
QPT.FT LRKT C L P F 
15 10 

SEQUENCE ID NO: 134 

SEQUENCE LENGTH: 31 amino acids 

RATFLLSLWECSLPQARLCLIVSRTGLLVQ 
1 5 10 15 20 25 30 

S 



9958552A2 I > 
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SEQUENCE ID NO: 135 

SEQUENCE LENGTH: 19 amino acids 
GQHFYWHCGSAACHRRGCV 
15 10 15 



SEQUENCE ID NO: 136 

SEQUENCE LENGTH: 39 amino acids 

KENVRDKKRATFLLSLWECSLPQARLCLIV 
1 5 10 15 20 25 30 

SRTGLLVQS 
35 



SEQUENCE ID NO: 137 

SEQUENCE LENGTH: 40 amino acids 

KENVRDKKKRATFLLSLWECSLPQARLCLI 
1 5 10 15 20 25 30 

VSRTGLLVQS 
35 40 



SEQUENCE ID NO: 138 

SEQUENCE LENGTH: 28 amino acids 

KENVRDKKKGQHFYWHCGSAACHRRGCV 
15 10 15 20 25 



SEQUENCE ID NO: 139 

SEQUENCE LENGTH: 27 amino acids 

KENVRDKKGQHFYWHCGSAACHRRGCV 
15 10 15 20 25 



SEQUENCE ID NO: 140 

SEQUENCE LENGTH: 3 9 amino acids 

ITHTRWGI TTWDSWSVRMKANWIQAQQNKS 
1 5 10 15 20 25 30 

LILSPSFTK 
35 



BNSDOCID: <WO 9958552A2_L> 
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SEQUENCE ID NO: 141 

SEQUENCE LENGTH: 4 0 amino acids 

KITHTRWGITTWDSWSVRMKANWIQAQQNK 
1 5 10 15 20 25 30 

SLILSPSFTK 
35 40 



SEQUENCE ID NO: 142 

SEQUENCE LENGTH: 16 amino acids 
KLLTPGGELPHGILGQ 
15 10 15 



SEQUENCE ID NO: 143 

SEQUENCE LENGTH: 15 amino acids 
LLTPGGELPHGILGQ 
15 10 15 



SEQUENCE ID NO: 14 4 

SEQUENCE LENGTH: 4 7 amino acids 

PPVCELEKITHTRWGITTWDSWSV R M K A N W 
1 5 10 15 20 25 30 

IQAQQNKSLILSPSFTK 
35 40 45 



SEQUENCE ID NO: 145 

SEQUENCE LENGTH: 4 8 amino acids 

PPVCELEKK I THTRWG I TTWDSWSVRMKAN 
1 5 10 15 20 25 30 

WIQAQQNKSLILSPSFTK 
35 40 45 



SEQUENCE ID NO: 14 6 

SEQUENCE LENGTH: 2 4 amino acids 

PPVCELEKKLLTPGGELPHGILGQ 
15 10 15 20 



BNSDOCID: <WO 9958552A2. 1_> 



• m 
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SEQUENCE ID NO: 147 
SEQUENCE LENGTH: 23 amino acids 

PPVCELEKLLTPGGELPHGILGQ 
15 10 15 20 

SEQUENCE ID NO: 148 

SEQUENCE LENGTH: 11 amino acids 
SLKDELEKMKI 
15 10 

SEQUENCE ID NO: 149 

SEQUENCE LENGTH: 12 amino acids 
SLKDELEKKMKI 
15 10 

SEQUENCE ID NO: 150 
SEQUENCE LENGTH: 12 amino acids 
LGQSS PEKKNKN 
15 10 

SEQUENCE ID NO: 151 

SEQUENCE LENGTH: 11 amino acids 
LGQSS PEKNKN 
15 10 

SEQUENCE ID NO: 152 

SEQUENCE LENGTH: 23 amino acids 

R L R R T ^ GRGSQIRSRNAFNRSEE 
15 10 15 20 

SEQUENCE ID NO: 153 

SEQUENCE LENGTH: 10 amino acids 
EPKVKEEKKT 
15 10 



BNSDOCID: <WO . 9958552A2 I > 
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SEQUENCE ID NO: 154 

SEQUENCE LENGTH: 11 amino acids 
EPKVKEEKKKT 
15 10 

SEQUENCE ID NO: 155 

SEQUENCE LENGTH: 32 amino acids 

EPKVKEEKKRLRRINGRGSQIRSRNAFNRS 
1 5 10 15 20 25 30 

E E 

SEQUENCE ID NO: 156 

SEQUENCE LENGTH: 31 amino acids 

EPKVKEEKRLRRINGRGSQIRSRNAFNRSE 
1 5 10 15 20 25 30 

E 

SEQUENCE ID NO: 157 

SEQUENCE LENGTH: 14 amino acids 
TFRYKGKQHPFFST 
15 10 

SEQUENCE ID NO: 158 

SEQUENCE LENGTH: 10 amino acids 

GPNAPEEKNH 

1 5 10 

SEQUENCE ID NO: 159 y * c 

SEQUENCE LENGTH: 11 amino acids 
GPNAPEEKKNH 
1 5 10 

SEQUENCE ID NO: 160 

SEQUENCE LENGTH: 23 amino acids 

GPNAPEEKKTFRYKGKQHPFFST 
15 10 15 20 



BNSDOCID: <WO 9958552A2 I > 
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SEQUENCE ID NO: 161 

SEQUENCE LENGTH: 22 amino acids 
GPNAPEEKTFRYKGKQHPFFST 
15 10 15 20 

SEQUENCE ID NO: 162 

SEQUENCE LENGTH: 6 amino acids 
M Q N T C V 
1 5 

SEQUENCE ID NO: 163 

SEQUENCE LENGTH: 1 amino acids 
K M Q N T C V 
1 5 

SEQUENCE ID NO: 164 

SEQUENCE LENGTH: 9 amino acids 

KCKIRVFSK 

1 5 

SEQUENCE ID NO: 165 

SEQUENCE LENGTH: 8 amino acids 

CKIRVFSK 

1 5 

SEQUENCE ID NO: 166 

SEQUENCE LEIT I: 14 amino acids 
FFKRTVQKMQNTCV 
15 10 

SEQUENCE ID NO: 167 

SEQUENCE LENGTH: 15 amino acids 
FFKRTVQKKMQNTCV 
15 10 15 



BNSDOCID: <WO_9958552A2.l_> 
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SEQUENCE ID NO: 168 

SEQUENCE LENGTH: 17 amino acids 
FFKRTVQKKCKIRVFSK 
15 10 15 

SEQUENCE ID NO: 169 

SEQUENCE LENGTH: 16 amino acids 
FFKRTVQKCKIRVFSK 
15 10 15 

SEQUENCE ID NO: 170 

SEQUENCE LENGTH: 7 amino acids 
L P H Y L A H 
1 5 

SEQUENCE ID NO: 171 

SEQUENCE LENGTH: 8 amino acids 

CLITWLTN 

1 5 

SEQUENCE ID NO: 172 

SEQUENCE LENGTH: 17 amino acids 
GSTTGLSATPLPHYLAH 
15 10 15 

SEQUENCE ID NO: 173 

SEQUENCE LENGTH: 118 amino acids 
GSTTGLSATPPLPHYLAH 
15 10 15 

SEQUENCE ID NO: 174 

SEQUENCE LENGTH: 19 amino acids 
GSTTGLSATPPCLITWLTN 
15 10 15 



BNSDOCID: <WO 9958S52A2 I > 
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SEQUENCE ID NO: 175 

SEQUENCE LENGTH: 18 amino acids 
GSTTGLSATPCLI TWLTN 
15 10 15 

SEQUENCE ID NO: 176 

SEQUENCE LENGTH: 9 amino acids 

RFADKPRPN 

1 5 

SEQUENCE ID NO: 177 

SEQUENCE LENGTH: 20 amino acids 
DLPTSPDQTRSGPVHVSVEP 
15 10 15 20 

SEQUENCE ID NO: 178 

SEQUENCE LENGTH: 19 amino acids 
DSAAGCSGTPRFADKPRPN 
15 10 15 

SEQUENCE ID NO: 179 

SEQUENCE LENGTH: 20 amino acids 
DSAAGCSGTPPRFADKPRPN 
15 10 15 20 

SEQUENCE ID NO: 180 

SEQUENCE LENGTH: 31 amino acids 

DSAAGCS^^PPDLPTSPDQTRSGPVHVSVE 
1 5 10 15 20 25 30 

P 

SEQUENCE ID NO: 181 

SEQUENCE LENGTH: 30 amino acids 

DSAAGCSGTPDLPTSPDQTRSGPVHVSVEP 
1 5 10 15 20 25 30 



BNSDOCID: <WO... 9958552A2 I > 



WO 99/58552 
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SEQUENCE ID NO: 182 

SEQUENCE LENGTH: 53 amino acids 

AHPETPAQNRLRIPCSRREVRSRACKPPGA 
1 5 10 15 20 25 30 

QGSDERRGKASPGRDCDVRTGRP 
35 40 45 50 

SEQUENCE ID NO: 183 

SEQUENCE LENGTH: 54 amino acids 

PAHPETPAQNRLRI PCSRREVRSRACKP PG 
1 5 10 15 20 25 30 

AQGS DERRGKASPGRDCDVRTGRP 
35 40 45 50 



SEQUENCE ID NO: 184 

SEQUENCE LENGTH: 2 0 amino acids 
RPTRRHPRRIASGS PAVGGR 
15 10 15 20 



SEQUENCE ID NO: 185 

SEQUENCE LENGTH: 63 amino acids 

VAIRGHPRPPAHPET PAQNRLRI PCSRREV 
1 5 10 15 20 25 30 

RSRACKPPGAQGSDERRGKASPGRDCDVRT 
35 40 45 50 55 60 

G R P 



SEQUENCE ID NO: 186 r „ 

SEQUENCE LENGTH: 64 amino acids 

VAIRGHPRPPPAHPETPAQNRLRI PCSRRE 
1 5 10 15 20 25 30 

VRSRACKPPGAQGSDERRGKASPGRDCDVR 
35 40 45 50 55 60 

T G R P 



BNSDOCID: <WO. . 9958552A2 I > 
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SEQUENCE ID NO: 187 

SEQUENCE LENGTH: 30 amino acids 

VAIRGHPRPPRPTRRHPRRIASGSPAVGGR 
1 5 10 15 20 25 30 



SEQUENCE ID NO: 188 

SEQUENCE LENGTH: 29 amino acids 

VAIRGHPRPRPTRRHPRRIASGSPAVGGR 
15 10 15 20 25 



SEQUENCE ID NO: 189 

SEQUENCE LENGTH: 85 amino acids 

RGRTSGRSLSCCRRPRCRPAVASRS TAPSP 
1 5 10 15 20 25 30 

RAGSRRCCLRTSCGAARPRRTRSACGDWVA 

35 40 45 50 55 60 

SPPTRSSSRTACGAASPPARSWSAP 

65 70 75 80 85 



SEQUENCE ID NO: 190 

SEQUENCE LENGTH: 8 amino acids 

GGGHLEEV 

1 5 



SEQUENCE ID NO: 191 

SEQUENCE LENGTH: 94 amino acids 
YFGGPDSTPRGRTSGR 
15 ?T 15 

ASRSTAPSPRAGSRRC 

35 40 45 

RSACGDWVAS PPTRSS 

65 70 75 

W S A P 



SLSCCR1. PRCRPAV 
20 25 30 

CLRTSCGAARPRRT 
50 55 60 

SRTACGAAS PPARS 
80 85 90 



BNSDOCID. <WO_ . 9958552A2J_> 
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SEQUENCE ID NO: 192 

SEQUENCE LENGTH: 95 amino acids 

YFGGPDSTPPRGRTSGRSLSCCRRPRCRPA 
1 5 10 15 20 25 30 

VASRSTAPSPRAGSRRCCLRTSCGAARPRR 

35 40 45 50 55 60 

TRSACGDWVASPPTRSSSRTACGAASPPAR 

65 70 75 80 85 90 

S W S A P 

95 



SEQUENCE ID NO: 193 

SEQUENCE LENGTH: 18 amino acids 
YFGGPDSTPPGGGHLEEV 
15 10 15 

SEQUENCE ID NO: 194 

SEQUENCE LENGTH: 17 amino acids 
YFGGPDSTPGGGHLEEV 
15 10 15 



SEQUENCE ID NO: 195 

SEQUENCE LENGTH: 6 amino acids 
H R V A D P 
1 5 



SEQUENCE I'D NO: 196 

SEQUENCE LENGTH: 13 amino acids ui. 
LSQSSELDPPSSR 
15 10 



SEQUENCE ID NO: 197 

SEQUENCE LENGTH: 14 amino acids 
LSQSSELDPPPSSR 
15 10 



BNSDOCID: <WO . 9958552A2 l_> 
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SEQUENCE ID NO: 198 

SEQUENCE LENGTH: 16 amino acids 
LSQSSELDPPHRVADP 
15 10 15 

SEQUENCE ID NO: 199 

SEQUENCE LENGTH: 15 amino acids 
LSQSSELDPHRVADP 
15 10 15 

SEQUENCE ID NO: 200 

SEQUENCE LENGTH: 11 amino acids 

VILLPEDTPPS 

15 10 

SEQUENCE ID NO: 201 

SEQUENCE LENGTH: 12 amino acids 

VILLPEDTPPPS 

15 10 

SEQUENCE ID NO: 202 

SEQUENCE LENGTH: 14 amino acids 
VI LLPEDTPPLLRA 
15 10 

SEQUENCE ID NO: 203 

SEQUENCE LENGTH: 13 amino acids 
VI LLPELDPLL^A 
15 10 

SEQUENCE ID NO: 204 

SEQUENCE LENGTH: 5 amino acids 
P S P L P 
1 5 



BNSDOCID: <WO 9958552A2 . 1 _> 



WO 99/58552 




PCT/NO99/00143 



33 



SEQUENCE ID NO: 205 

SEQUENCE LENGTH: 25 amino acids 

PLLFHRPCSPSPALGATVLAVYRYE 
1 5 10 15 20 25 

SEQUENCE ID NO: 206 

SEQUENCE LENGTH: 24 amino acids 

LLFHRPCS PS PALGATVLAVYRYE 
1 5 10 15 20 

SEQUENCE ID NO: 207 

SEQUENCE LENGTH: 13 amino acids 
APRPPLGPPSPLP 
15 10 

SEQUENCE ID NO: 208 

SEQUENCE LENGTH: 14 amino acids 

APRPPLGPPPSPLP 

15 10 

SEQUENCE ID NO: 209 

SEQUENCE LENGTH: 34 amino acids 

APRPPLGPPPLLFHRPCSPSPALGATVLAV 
1 5 10 15 20 25 30 

Y R Y E 



SEQUENCE ID NO: 210 * , 

SEQUENCE LENGTH: 33 amino acids 

APRPPLGPPLLFHRPCSPSPALGATVLAVY 
1 5 10 15 20 25 30 

RYE 



BNSDOCID: <WO 99S8552A2_L> 
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SEQUENCE ID NO: 211 

SEQUENCE LENGTH: 28 amino acids 

TQVLPQGCSLSLLHTTFPHRQVPHILDW 
15 10 15 20 25 



SEQUENCE ID NO: 212 

SEQUENCE LENGTH: 29 amino acids 

PTQVLPQGCSLSLLHTTFPHRQVPHILDW 
15 10 15 20 25 



SEQUENCE ID NO: 213 

SEQUENCE LENGTH: 54 amino acids 

PLQSFPKDAASAFSTPRFPTDKFPTSWTGS 
1 5 10 15 20 25 30 

CPGQPHGTRAFCQPGPEFNAFSAC 
35 40 45 50 

SEQUENCE ID NO: 214 

SEQUENCE LENGTH: 53 amino acids 

LQS FPKDAASAFS TPRFPTDKFPTSWTGSC 
1 5 10 15 20 25 30 

PGQPHGTRAFCQPGPEFNAFSAC 
35 40 45 50 

SEQUENCE ID NO: 215 

SEQUENCE LENGTH: 38 amino acids 

PSPRPQSQPPTQVLPQGCSLSLLHTTFPHR 
1 5 10 15 20 25 30 

QVPHILDW 
35 



BNSDOCID: <WO .. 9958S52A2 I > 
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SEQUENCE ID NO: 216 

SEQUENCE LENGTH: 39 amino acids 

PSPRPQSQPPPTQVLPQGCSLSLLHTTFPH 
1 5 10 15 20 25 30 

RQVPHILDW 
35 

SEQUENCE ID NO: 217 

SEQUENCE LENGTH: 64 amino acids 

PSPRPQSQPPPLQSFPKDAASAFSTPR 
1 5 10 15 20 25 

DKFPTSWTGSCPGQPHGTRAFCQPGPE 
35 40 45 50 55 

F S A C 



SEQUENCE ID NO: 218 

SEQUENCE LENGTH: 63 amino acids 

PSPRPQSQPPLQSFPKDAASAFSTPRFPTD 

1 5 10 15 20 25 30 

KFPTSWTGSCPGQPHGTRAFCQPGPEFNAF 
35 40 45 50 55 60 

SAC 

SEQUENCE ID NO: 219 

SEQUENCE LENGTH: 30 amino acids 

TAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 220 

SEQUENCE LENGTH: 31 amino acids 

PTAWPGRRRFTTPEPYCLCTPLGPWAPRFLW 
1 5 10 15 20 25 30 



BNSOOC1D:<WO 9958552A2.I > 
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SEQUENCE ID NO: 221 

SEQUENCE LENGTH: 50 amino acids 

PRPGPAGGALLPRSLTAFVPHSGHGLPVSS 
1 5 10 15 20 25 30 

GEPAYTP I PHDVPHGTPPFC 
35 40 45 50 

SEQUENCE ID NO: 222 

SEQUENCE LENGTH: 49 amino acids 

RPGPAGGALLPRSLTAFVPHSGHGLPVSSG 
1 5 10 15 20 25 30 

EPAYTPI PHDVPHGTPPFC 
35 40 45 



SEQUENCE ID NO: 223 

SEQUENCE LENGTH: 39 amino acids 

DLPAVPGPPTAWPGRRRFTTPEPYCLCTPL 
1 5 10 15 20 25 30 

GPWAPRFLW 
35 



SEQUENCE ID NO: 224 

SEQUENCE LENGTH: 40 amino acids 

DLPAVPGPPPTAWPGRRRFTTPEPYCLCTP 
1 5 10 15 20 25 30 

LGPWAPRFLW 
35 40 

SEQUENCE ID NO: 225 

SEQUENCE LENGTH: 59 amino acids 

DLPAVPG PPPRPGPAGGALLPRS LTAFVPH 
1 5 10 15 20 25 30 

SGHGLPVSSGEPAYTPIPHDVPHGTPPFC 
35 40 45 50 55 



BNSDOCID: <WO__ 9958552A2J _> 
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SEQUENCE ID NO: 226 

SEQUENCE LENGTH: 58 amino acids 

DLPAVPGPPRPGPAGGALLPRS LTAFVPHS 
1 5 10 15 20 25 30 

GHGLPVSSGEPAYTPIPHDVPHGTPPFC 
35 40 45 50 55 

SEQUENCE ID NO: 227 

SEQUENCE LENGTH: 8 amino acids 

QWGLSWMS 

1 5 

SEQUENCE ID NO: 228 

SEQUENCE LENGTH: 14 amino acids 

NGDCHGCPEGRQSL 

15 10 

SEQUENCE ID NO: 229 

SEQUENCE LENGTH: 17 amino acids 
FTMDRVLTPQWGLSWMS 
15 10 15 

SEQUENCE ID NO: 230 

SEQUENCE LENGTH: 18 amino acids 
FTMDRVLTPPQWGLSWMS 
1 5 10 15 

SEQUENCE ID NO: 231 

SEQUENCE LENGTH: 24 amino acids 

FTMDRVLTPPNGDCHGCPEGRQSL 
15 10 15 20 

SEQUENCE ID NO: 232 

SEQUENCE LENGTH: 23 amino acids 

FTMDRVLT PNGDCHGCPEGRQSL 
15 10 15 20 



BNSDOCID: <WO 9958552A2J. > 
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SEQUENCE ID NO: 233 

SEQUENCE LENGTH: 115 amino acids 

HHPARQCPHC IMHLQTQLIHRNLTGPSQLT 
1 5 10 15 20 25 30 

SLHRSPYQIAATPWTTDFAASFFLNPVTPF 

35 40 45 50 55 60 

LLCRRCQGKDVLCTNARCLSQTSPSHHKAL 

65 70 75 80 85 90 

SRTTTQCMNTTPWLAVRPAKAFPLL 

95 100 105 110 115 

SEQUENCE ID NO: 234 

SEQUENCE LENGTH: 116 amino acids 

PHHPARQCPHCIMHLQTQLIHRNLTGPSQL 
1 5 10 15 20 25 30 

TSLHRSPYQIAATPWTTDFAAS FFLNPVTP 

35 40 45 50 55 60 

FLLCRRCQGKDVLCTNARCLSQTSPSHHKA 

65 70 75 80 85 90 

LSRTTTQCMNTTPWLAVRPAKAFPLL 

95 100 105 110 115 

SEQUENCE ID NO: 235 

SEQUENCE LENGTH: 23 amino acids 

HTIQHASVPTASCISKLNSYTEN 
15 10 15 20 



BNSDOCID: <WO_._..9958552A2 I > 



WO 99/58552 




PCT/NO99/00143 



SEQUENCE ID NO: 236 

SEQUENCE LENGTH: 12 6 amino acids 

PQVGMRPSNPPHHPARQCPHC IMHLQTQLI 
1 5 10 15 20 25 30 

HRNLTGPSQLTSLHRSPYQIAATPWTTDFA 

35 40 45 50 55 60 

ASF F L NPVTPFLLCRRCQGKDVLCTNARCL 

65 70 75 80 85 90 

SQTSPSHHKALSRTTTQCMNTTPWLAVRPA 

95 100 105 110 115 120 

K A F P L L 

125 

SEQUENCE ID NO: 237 

SEQUENCE LENGTH: 127 amino acids 

PQVGMRPSNPPPHHPARQCPHCIMHLQTQL 
1 5 10 15 20 25 30 

IHRNLTGPSQLTSLHRSPYQIAATPWTTDF 

35 40 45 50 55 60 

AAS FFLNPVTPFLLCRRCQGKDVLCTNARC 

65 70 75 80 85 90 

LSQTSPSHHKALSRTTTQCMNTTPWLAVRP 

95 100 105 110 115 120 

A K A F P L L 

125 

SEQUENCE ID NO: 238 

SEQUENCE LENGTH: 34 amino acids 

PQ'VGMRPSNPPHTIQHASVPTASCISKLNS 
1 5 10 15 20 25 30 

Y T E N 
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SEQUENCE ID NO: 239 

SEQUENCE LENGTH: 33 amino acids 

PQVGMRPSNPHTIQHASVPTASCISKLNSY 
1 5 10 15 20 25 30 

TEN 



SEQUENCE ID NO: 240 

SEQUENCE LENGTH: 51 amino acids 

WAARSWCERRAAAVAPLAPWAWGCPAGCTP 
1 5 10 15 20 25 30 

PVAAR. ACAATRPEGWRS PCTH 
35 40 45 50 



SEQUENCE ID NO: 241 

SEQUENCE LENGTH: 52 amino acids 

PWAARSWCERRAAAVAPLAPWAWGCPAGCT 
1 5 10 15 20 25 30 

PPVAARACAATRPEGWRSPCTH 
35 40 45 50 



SEQUENCE ID NO: 242 

SEQUENCE LENGTH: 74 amino acids 

RGLRGAGARGGLRLLR 

15 10 15 

PPLRLGPALLPAPRGG 

35 40 45 

GAGGDRGHPGPAAL 

65 70 



HLRPGLGDALRGVH 
20 25 30 

EAPAHT DARARRVH 
50 55 60 



SEQUENCE ID NO: 243 

SEQUENCE LENGTH: 61 amino acids 

EEKLARCRP PWAARSWCE RRAAAVAPLAPW 
1 5 10 15 20 25 30 

AWGCPAGCTPPVAARACAATRPEGWRSPCTH 
35 40 45 50 55 60 



BNSDOCID: <WO 99S8552A2 I > 
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SEQUENCE ID NO: 244 

SEQUENCE LENGTH: 62 amino acids 

EEKLARCRPPPWAARSWCERRAAAVAPLAP 
1 5 10 15 20 25 30 

WAWGCPAGCTPPVAARACAATRPEGWRSPCTH 
35 40 45 50 55 60 . 

SEQUENCE ID NO: 245 

SEQUENCE LENGTH: 84 amino acids 

EEKLARCRPPRGLRGAGARGGLRLLRHLRP 
1 5 10 15 20 25 30 

GLGDALRGVHPPLRLGPALLPAPRGGEAPA 

35 40 45 50 55 60 

HTDARARRVHGAGGDRGHPGPAAL 

65 70 75 80 

SEQUENCE ID NO: 246 

SEQUENCE LENGTH: 83 amino acids 

EEKLARCRPRGLRGAGARGGLRLLRHLRPG 
1 5 10 15 20 25 30 

LGDALRGVHPPLRLGPALLPAPRGGEAPAH 

35 40 45 50 55 60 

TDARARRVHGAGGDRGHPG PAAL 

65 70 75 80 



BNSDOCID: <WO_. 9958552A2J. > 



WO 99/58552 |JP PCT/NO99/00143 

42 

SEQUENCE ID NO: 247 

SEQUENCE LENGTH: 163 amino acids 

QPPVS PRPRRPGRPRAPPPPQPMVSPRRRT 
1 5 10 15 20 25 30 

TGPPWRPPPLQSTMSPPPQALHQAQLLLWC 

35 40 45 50 55 60 

TTAPLPGLPQPQPARALHSQFPATTLILLP 

65 70 75 80 85 90 

PLPAIAPRLMPVALTIARYLLSPPPITALL 

95 100 105 110 115 120 

PSCLLGSLSFSCLFTFQTSSLIPLWKIPAP 

125 130 135 140 145 150 

TTTKS CRETFLKW 

155 160 



SEQUENCE ID NO: 248 

SEQUENCE LENGTH: 85 amino acids 

SPGCHLGPGDQAAPGLHRPPSPWCHLGAGQ 
1 5 10 15 20 25 30 

QARLGVHRPSSPQCHLGLRLCIRLS FYSGA 

35 40 45 50 55 60 

QRHLCQGYHNPSQQEHSILNSQPPL 

65 70 75 80 85 



BNSDOCID: <WO 995B552A2J. > 
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SEQUENCE ID NO: 24 9 

SEQUENCE LENGTH: 172 amino acids 

KPAPGSTAPQPPVSPRPRRPGRPRAPPPPQ 
1 5 10 15 20 25 30 

PMVSPRRRTTGPPWRPPPLQS TMSPPPQAL 

35 40 45 50 55 60 

HQAQLLLWC-T TAPLPGLPQPQPARALHS QF 

65 70 75 80 85 90 

PATTLI LLPPLPAIAPRLMPVALTIARYLL 

95 100 105 110 115 120 

SPPPITALLPSCLLGSLSFSCLFTFQTSSL 

125 130 135 140 145 150 

IPLWKI PAPTTTKSCRETFLKW 

155 160 165 170 

SEQUENCE ID NO: 250 

SEQUENCE LENGTH: 173 amino acids 

KPAPGSTAPPQPPVSPRPRRPGRPRAPPPP 
1 5 10 15 20 25 30 

QPMVSPRRRTTGPPWRPPPLQSTMSPPPQA 

35 40 45 50 55 60 

LHQAQLLLWCTTAPLPGLPQPQPARALHSQ 

65 70 75 80 85 90 

FPATTLILLPPLPAIAPRLMPVALTIARYL 

95 100 105 110 115 120 

LSPPPITALLPSCLLGSLSFSCLFTFQTSS 

125 130 135 140 145 150 

LIPLWKIPAPTTTKSCRETFLKW 

155 160 165 170 



BNSDOCID: <WO__.9958S52A2.l..> 
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SEQUENCE ID NO: 251 

SEQUENCE LENGTH: 65 amino acids 

KPAPGSTAPPSPGCHLGPGDQAAPGLHRPP 
1 5 10 15 20 25 30 

S PWCHLGAGQQARLGVHRPS SPQCHLGLRL 

C IRLS FYSGAQRHLCQGYHNPSQQEHS ILN 
35 40 45 50 55 60 

S Q P P L 
65 



SEQUENCE ID NO: 252 

SEQUENCE LENGTH: 94 amino acids 

KPAPGSTAPS PGCHLGPGDQAAPGLHRPPS 
1 5 10 15 20 25 30 

PWCHLGAGQQARLGVHRPSSPQCHLGLRLC 

35 40 45 50 55 60 

IRLSFYSGAQRHLCQGYHNPSQQEHSILNS 

65 70 75 80 85 90 

Q P P L 



SEQUENCE ID NO: 253 

SEQUENCE LENGTH: 113 amino acids 

QPMVSPRRRTTGPPWRPPPLQSTMSPPPQA 
1 5 10 15 20 25 30 

LHQAQLLLWCTTAPLPGLPQPQPARALHSQ 

35 40 45 50 55 60 

FPATTLILLPPLPAIAPRLMPVALTIARYL 

65 70 75 80 85 90 

LSPPPITALLPSCLLGSLSFSCLFTFQTSS 

LI PLWKIPAPTTTKSCRETFLKW 
95 100 105 110 
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SEQUENCE ID NO: 254 

SEQUENCE LENGTH: 65 amino acids 

SPWCHLGAGQQARLGVHRPS S PQCHLGLRL 
1 5 10 15 20 25 30 

CIRLS FYSGAQRHLCQGYHNPSQQEHS I LN 

35 40 45 50 55 60 

S Q P P L 

65 

SEQUENCE ID NO: 255 

SEQUENCE LENGTH: 18 amino acids 
RPPPGS TAPQPMVS PRRR 
15 10 15 

SEQUENCE ID NO: 256 

SEQUENCE LENGTH: 19 amino acids 
RPPPGS TAPPQPMVSPRRR 
15 10 15 

SEQUENCE ID NO: 257 

SEQUENCE LENGTH: 18 amino acids 
RPPPGSTAPPSPWCHLGA 
15 10 15 

SEQUENCE ID NO: 258 

SEQUENCE LENGTH: 17 amino acids 
RP PPGSTAPSPWCHLGA 
15 10 15 

SEQUENCE ID NO: 259 

SEQUENCE LENGTH: 14 amino acids 
RPRAPPPPSPWCHL 
15 10 
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SEQUENCE ID NO: 260 

SEQUENCE LENGTH: 13 amino acids 

RPRAPPPPPSPWC 

1 5 10 

SEQUENCE ID NO: 261 

SEQUENCE LENGTH: 16 amino acids 
RPRAPPPPAHGVTSAP 
15 10 15 

SEQUENCE ID NO: 262 

SEQUENCE LENGTH: 13 amino acids 
RPRAPPPPPAHGV 
15 10 

SEQUENCE ID NO: 263 

SEQUENCE LENGTH: 14 amino acids 
APGLHRPPQPMVSP 
15 10 

SEQUENCE ID NO: 264 

SEQUENCE LENGTH: 15 amino acids 
AAPGLHRPQPMVSPR 
15 10 15 

SEQUENCE ID NO: 265 

SEQUENCE LENGTH: 13 amino acids 
PGLHRPPPAHGVT 
15 10 

SEQUENCE ID NO: 266 

SEQUENCE LENGTH: 14 amino acids 

APGLHRPPAHGVTS 

15 10 
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SEQUENCE ID NO: 267 

SEQUENCE LENGTH: 21 amino acids 

VDRPQHTEWLSWSNLYRIRHQ 

15 10 15 20 



SEQUENCE ID NO: 268 

SEQUENCE LENGTH: 10 amino acids 

HYLCTDVAPR 

15 10 



SEQUENCE ID NO: 269 

SEQUENCE LENGTH: 11 amino acids 

HYLCTDVAPPR 

15 10 



SEQUENCE ID NO: 270 

SEQUENCE LENGTH: 31 amino acids 

HYLCTDVAPPVDRPQHTEWLSWSNLYRIRH 
1 5 10 15 20 25 30 

Q 

SEQUENCE ID NO: 271 

SEQUENCE LENGTH: 30 amino acids 

HYLCTDVAPVDRPQHTEWLSWSNLYRIRHQ 
1 5 10 15 20 25 30 



SEQUENCE ID NO: 272 

SEQUENCE LENGTH: 108 amino acids 

SAYLSPLGTTWLRTCACRLPRPAASCLCTT 
1 5 10 15 20 25 30 

PSLLWPRRTCPAGSPRATSSPWRMPAPKSC 

35 40 45 50 55 60 

CTTGLAFTS P I GLGWRSATASGYARIWPVL 

65 70 75 80 85 90 

SLTCQSWSTSLPSTAVTW 

95 100 105 
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SEQUENCE ID NO: 273 

SEQUENCE LENGTH: 109 amino acids 

PSAYLSPLGTTWLRTCACRLPRPAASCLCT 
1 5 10 15 20 25 30 

TPSLLWPRRTCPAGSPRATSS PWRMPAPKS 

35 40 45 50 55 60 

CCTTGLAFTSPIGLGWRSATASGYARI WPV 

65 70 75 80 85 90 

LSLTCQSWSTSLPSTAVTW 

95 100 105 



SEQUENCE ID NO: 274 

SEQUENCE LENGTH: 12 amino acids 
PAPI FLLWGPLG 
15 10 



SEQUENCE ID NO: 275 

SEQUENCE LENGTH: 11 amino acids 
API FLLWGPLG 
15 10 



SEQUENCE ID NO: 276 

SEQUENCE LENGTH: 117 amino acids 

LPARAPGPPSAYLSPLGTTWLRTCACRLPR 
1 5 10 15 20 25 30 

P£ASCLCTTPSLLWPRRTCPAGSPRATSSP 

35 40 45 50 55 60 

WRMPAPKS CCT TGLAFTS P I GLGWRSATAS 

65 70 75 80 85 90 

GYARIWPVLSLTCQSWSTSLPSTAVTW 

95 100 105 110 115 
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SEQUENCE ID NO: 277 

SEQUENCE LENGTH: 118 amino acids 

LPARAPGPPPSAYLSPLGT TWLRTCACRLP 
1 5 10 15 20 25 30 

RPAASCLCTTPSLLWPRRTCPAGSPRATSS 

35 40 45 50 55 60 

PWRMPAPKSCCTTGLAFTS P IGLGWRSATA 

65 70 75 80 85 90 

SGYARIWPVLSLTCQSWSTSLPSTAVTW 

95 100 105 110 115 



SEQUENCE ID NO: 278 
SEQUENCE LENGTH: 21 amino acids 
LPARAPGPPPAPI FLLWGPLG 
15 10 15 20 



SEQUENCE ID NO: 279 

SEQUENCE LENGTH: 20 amino acids 
LPARAP GP PAP I FLLWGPLG 
15 10 15 20 

SEQUENCE ID NO: 280 

SEQUENCE LENGTH: 14 amino acids 
DLEHHGGVTRHRHR 
1 5 10 



SEQUENCE ID NO: 281 

SEQUENCE LENGTH: 11 amino acids 

LVSDYSMTPRP 

15 10 



SEQUENCE ID NO: 282 

SEQUENCE LENGTH: 12 amino acids 

LVSDYSMTPPRP 

15 10 
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SEQUENCE ID NO: 283 

SEQUENCE LENGTH : 24 amino acids 

lvsdysmtppdlehhggvtrhr.hr 

15 10 15 20 



SEQUENCE ID NO: 284 

SEQUENCE LENGTH: 23 amino acids 
LVSDYSMTPDLEHHGGVTRHRHR 
15 10 15 20 



SEQUENCE ID NO: 285 

SEQUENCE LENGTH: 51 amino acids 

FHHIATDVGPFVRIGFLKIKGKIKGKSLRK 
1 5 10 15 20 25 30 

PNWKTQHKLKRALMFL IVKKL 
35 40 45 50 



SEQUENCE ID NO: 286 

SEQUENCE LENGTH: 52 amino acids 

seq id no 28 6; 

PFHHIATDVGPFVRIGFLKIKGKIKGKSLR 
1 5 10 15 20 25 30 

KPNWKTQHKLKRALMFLIVKKL 
35 40 45 50 



SEQUENCE ID NO: 287 

SEQUENCE LENGTH: 12 amino acids 
PSITLQQMLAPS 
15 10 



SEQUENCE ID NO: 288 

SEQUENCE LENGTH: 11 amino acids 
S ITLQQMLAPS 
15 10 
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SEQUENCE ID NO: 289 

SEQUENCE LENGTH: 60 amino acids 

TSCNEMNPPFHHIATDVGPFVRIGFLKIKG 
1 5 10 15 20 25 30 

KIKGKSLRKPNWKTQHKLKRALMFLIVKKL 
35 40 45 50 55 60 

SEQUENCE ID NO: 290 

SEQUENCE LENGTH: 61 amino acids 

TSCNEMNPPPFHHIATDVGPFVRIGFLKIK 
1 5 10 15 20 25 30 

GKIKGKSLRKPNWKTQHKLKRALMFL IVKK 
35 40 45 50 55 60 

L 



SEQUENCE ID NO: 291 

SEQUENCE LENGTH: 20 amino acids 
TSCNEMNPPS I TLQQMLAPS 
1 5 10 15 20 



SEQUENCE ID NO: 292 

SEQUENCE LENGTH: 21 amino acids 
TSCNEMNPPPSITLQQMLAPS 
15 10 15 20 



SEQUENCE ID NO: 293 

SEQUENCE LENGTH: 10 amino acids 
LEM I LFLM-T F 
15 10 

SEQUENCE ID NO: 294 

SEQUENCE LENGTH: 18 amino acids 
HPCITKTFLEMILFLMTF 
15 10 15 
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SEQUENCE ID NO: 295 
SEQUENCE LENGTH: 19 amino acids 
HPCI TKTFFLEMILFLMTF 
15 10 15 

SEQUENCE ID NO: 296 

SEQUENCE LENGTH: 11 amino acids 

HPCITKTFFWR 

15 10 

SEQUENCE ID NO: 297 

SEQUENCE LENGTH: 10 amino acids 

HPCITKTFWR 

15 10 

SEQUENCE ID NO: 298 
SEQUENCE LENGTH: 22 amino acids 
LMFEHSQMRLNSKNAHLPI I S F 
15 10 15 20 

SEQUENCE ID NO: 299 

SEQUENCE LENGTH: 30 amino acids 

EYGS I IAFLMFEHSQMRLNSKNAHLPIISF 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 300 

SEQUENCE LENGTH: 31 amino acids 

EYGS I IAFFLMFEHS QMRLNS KNAHLP I I S 
1 5 10 15 20 25 30 

F 

SEQUENCE ID NO: 301 

SEQUENCE LENGTH: 15 amino acids 
HLNKGRRLGDKIRAT 
15 10 15 
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SEQUENCE ID NO: 302 

SEQUENCE LENGTH: 16 amino acids 
FHLNKGRRLGDKIRAT 
15 10 15 

SEQUENCE ID NO: 303 

SEQUENCE LENGTH: 23 amino acids 

VTSGTPFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 304 

SEQUENCE LENGTH: 24 amino acids 

VTSGTPFFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 305 

SEQUENCE LENGTH: 10 amino acids 

VTSGTPFFFI 

15 10 

SEQUENCE ID NO: 306 

SEQUENCE LENGTH: 9 amino acids 

VTSGTPFFI 

1 5 

SEQUENCE ID NO: 307 

SEQUENCE LENGTH: 10 amino acids J 

CEIERIKFFF 

1 5 10 

SEQUENCE ID NO: 308 

SEQUENCE LENGTH: 11 amino acids 
CEIERIHFFSK 
15 10 
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SEQUENCE ID NO: 309 

SEQUENCE LENGTH: 10 amino acids 

CEIERIHFSK 

15 10 



SEQUENCE ID NO: 310 

SEQUENCE LENGTH: 8 amino acids 

FRYISKSI 

1 5 



SEQUENCE ID NO: 311 

SEQUENCE LENGTH: 7 amino acids 
R Y I S K S I 
1 5 



SEQUENCE ID NO: 312 

SEQUENCE LENGTH: 16 amino acids 

FKKYEPIFFRYISKSI 

15 10 15 



SEQUENCE ID NO: 313 

SEQUENCE LENGTH: 15 amino acids 
FKKYEPIFRYISKSI 
15 10 15 



SEQUENCE ID NO: 314 

SEQUENCE LENGTH: 56 amino acids 

FPDSDQPGPLYPLDPSCLISSASNPQELSD 
1 5 10 15 20 25 30 

CHYIHLAFGFSNWRSCPVLPGHCGVQ 
35 40 45 50 55 
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SEQUENCE ID NO: 315 

SEQUENCE LENGTH: 55 amino acids 

PDSDQPGPLYPLDPSCLISSASNPQELSDC 
1 5 10 15 20 25 30 

HYI HLAFGFSNWRSCPVLPGHCGVQ 
35 40 45 50 55 



SEQUENCE ID NO: 316 

SEQUENCE LENGTH: 9 amino acids 

LNMFASVFS 

1 5 

SEQUENCE ID NO: 317 

SEQUENCE LENGTH: 10 amino acids 

LNMFASVFFS 

1 5 10 15 

SEQUENCE ID NO: 318 
SEQUENCE LENGTH: 64 amino acids 
LNMFASVFFPDSDQPG 
15 10 15 

SNPQELSDCHYIHLAF 
35 40 45 

C G V Q 



PLYPLDPSCLISSA 
20 25 30 

GFSNWRSCPVLPGH 
50 55 60 



SEQUENCE ID NO: 319 

SEQUENCE LENGTH: 63 amino acids 
L N MFASVFPDSDQPGP 
15 10 15 

NPQELSDCHY IHLAFG 
35 40 45 

G V Q 



LYPLDPSCLISSAS 
20 25 30 

FSNWRSCPVLPGHC 
50 55 60 
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SEQUENCE ID NO: 320 

SEQUENCE LENGTH: 63 amino acids 

AMEETVVVAVATVETEVEAMEETGVVAAME 
1 5 10 15 20 25 30 

ETEVGATEETEVAMEAKWEEETTTEMI SAT 
35 40 45 50 55 60 

D H T 



SEQUENCE ID NO: 321 

SEQUENCE LENGTH: 55 amino acids 

LWVRPWLWEWLRWRPKWRLWRRQEWWRLWR 
1 5 10 15 20 25 30 

RPRWGLRRRPRWLWRENGRKKRLQK 
35 40 45 50 55 

SEQUENCE ID NO: 322 

SEQUENCE LENGTH: 71 amino acids 

YGGDRSRGAMEETVVVAVATVETEVEAMEE 
1 5 10 15 20 25 30 

TGVVAAMEETEVGATEETEVAMEAKWEEET 

35 40 45 50 55 60 

TTEMISATDHT 

65 70 

SEQUENCE ID NO: 323 

SEQUENCE LENGTH: 72 amino acids 

YGGDRSRC AMEETVVVAVATVETEVEAME 
1 5 10 15 20 25 30 

ETGVVAAMEE TEVGATEETEVAMEAKWEEE 

35 40 45 50 55 60 

TTTEMI SATDHT 

65 70 
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SEQUENCE ID NO: 324 

SEQUENCE LENGTH: 64 amino acids 

YGGDRSRGGLWVRPWLWEW. LRWEPKWRLWR 
1 5 10 15 20 25 30 

RQEWWRLWRRPRWGLRRRPRWLWRE N G R K K 
35 40 45 50 55 60 

R L Q K 



SEQUENCE ID NO: 325 

SEQUENCE LENGTH: 63 amino acids 

YGGDRS RGLWVRPWLWEWLRWEPKWRLWRR 
1 5 10 15 20 25 30 

QEWWRLWRRPRWGLRRRPRWLWRENGRKKR 
35 40 45 50 55 60 

L Q K 



SEQUENCE ID NO: 326 

SEQUENCE LENGTH: 9 amino acids 

EFGGGRRQK 

1 5 



SEQUENCE ID NO: 327 

SEQUENCE LENGTH: 8 amino acids 

EFGGRRQK 

1 5 



SEQUENCE ID NO: 328 

SEQUENCE LENGTH: 15 amino acids 
RRAKGGGAGASNPRQ 
15 10 15 



SEQUENCE ID NO: 329 

SEQUENCE LENGTH: 16 amino acids 

GRRAKGGGAGASNPRQ 

15 10 15 
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SEQUENCE ID NO: 330 

SEQUENCE LENGTH: 21 amino acids 
DVGLREGALELPTRGNKRNVA 
15 10 15 20 

SEQUENCE ID NO: 331 

SEQUENCE LENGTH: 24 amino acids 

MRGGGGVGGRRAKGGGAGASNPRQ 
15 10 15 20 

SEQUENCE ID NO: 332 

SEQUENCE LENGTH: 25 amino acids 

MRGGGGVGGGRRAKGGGAGASNPRQ 
1 5 10 15 20 25 

SEQUENCE ID NO: 333 

SEQUENCE LENGTH: 30 amino acids 

MRGGGGVGGDVGLREGALELPTRGNKRNVA 
1 5 10 15 20 25 30 

SEQUENCE ID NO: 334 

SEQUENCE LENGTH: 29 amino acids 

MRGGGGVGDVGLREGALELPTRGNKRNVA 
1 5 10 15 20 25 

SEQUENCE ID NO: 335 

SEQUENCE LENGTH: 25 amino acids 

VWQLAGPML^lGWRS LGSWFCRMYGI 
1 5 10 15 20 25 

SEQUENCE ID NO: 336 

SEQUENCE LENGTH: 4 6 amino acids 

CGSWPALCWRAGGVWAVGSAGCMEYDPEAL 
1 5 10 15 20 25 30 

PAAWGPAAAATVHPRR 
35 40 45 
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SEQUENCE ID NO: 337 

SEQUENCE LENGTH: 33 amino acids 

RRYPCEWGVWQLAGPMLAGWRSLGSWFCRM 
1 5 10 15 20 25 30 

Y G I 



SEQUENCE ID NO: 338 

SEQUENCE LENGTH: 34 amino acids 

RRYPCEWGGVWQLAGPMLAGWRSLGSWFCR 
1 5 10 15 20 25 30 

M Y G I 

SEQUENCE ID NO: 339 

SEQUENCE LENGTH: 55 amino acids 

RRYPCEWGGCGSWPALCWRAGGVWAVGSA. G 
1 5 10 15 20 25 30 

CM E Y D P EAL PAAWGPAAAATVH P RR 
35 40 45 50 55 

SEQUENCE ID NO: 340 

SEQUENCE LENGTH: 54 amino acids 

RRYPC'EWGCGSW PALCWRAGGVWAVGSAGC 
1 5 10 15 20 25 30 

MEYDPEALPAAWGPAAAATVHPRR 
35 40 45 50 

SEQUENCE ID NO: 341 «s 
SEQUENCE LENGTH: 43 amino acids 

LWLWAGWTVWWSCGPGEKGHGWPS LPTMAL 
1 5 10 15 20 25 30 

LLLRFSCMRVASY 
35 40 
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SEQUENCE ID NO: 342 

SEQUENCE LENGTH: 4 4 amino acids 
GLWLWAGWTVWWSCGPG 
15 10 15 

LLLLRFSCMRVASY 
35 40 

SEQUENCE ID NO: 343 

SEQUENCE LENGTH: 84 amino acids 
GCGCGPAGQYGGAVGLARRGTAGCLPCPPW 

1 5 10 15 20 25 30 

LCCCCAFPACGLPGTDGWRGWQGSGCVRVS 

35 40 45 50 55 60 

GSAPWAPGFPFSPPCPLCGTQPRW 

65 70 75 80 

SEQUENCE ID NO: 344 

SEQUENCE LENGTH: 83 amino acids 

CGCGPAGQYGGAVGLARRGTAGCLPCPPWL 
1 5 10 15 20 25 30 

CCCCAFPACGL PGTDGWRGWQGSGCVRVSG 

35 40 45 50 55 60 

SAPWAPGFPFS PPCPLCGTQPRW 

65 70 75 80 

SEQUENCE ID NO: 345 

SEQUENCE LENGTH: 51 amino acids 

LAFNVPGGLWL'-AGWTVWWSCGPGEKGHGW 
1 5 10 15 20 25 30 

PSLPTMALLLLRFSCMRVASY 
35 40 45 50 



EKGHGWPSLPTMA 
20 25 30 
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SEQUENCE ID NO: 34 6 

SEQUENCE LENGTH: 52 amino acids 

LAFNVPGGGL W L WAGWTVWWSCGPGEKGHG 
1 5 10 15 20 25 30 

WPSLPTMALLLLRFSCMRVASY 
35 40 45 50 

SEQUENCE ID NO: 347 

SEQUENCE LENGTH: 92 amino acids 

LAFNVPGGGCGCGPAGQYGGAVGLARRGTA 
1 5 10 15 20 25 30 

GCLPCPPWLCCCCAFPACGLPGTDGWRGWQ 

35 40 45 50 55 60 

GSGCVRVSGSAPWAPGFPFSPPCPLCGTQP 

65 70 75 80 85 90 

R W 



SEQUENCE ID NO: 348 

SEQUENCE LENGTH: 91 amino acids 

LAFNVPGGCGCGPAGQYGGAVGLARRGTAG 
1 5 10 15 20 25 30 

CLPCPPWLCCCCAFPACGLPGTDGWRGWQG 

35 40 45 50 55 60 

SGCVRVSGSAPWAPGFPFSPPCPLCGTQPR 

65 70 75 80 85 90 

W 



SEQUENCE ID NO: 34 9 

SEQUENCE LENGTH: 17 amino acids 
PPMPMPGQREAPGRQEA 
15 10 15 



SEQUENCE ID NO: 350 

SEQUENCE LENGTH: 18 amino acids 
GPPMPMPGQREAPGRQEA 
15 10 15 
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SEQUENCE ID NO: 351 

SEQUENCE LENGTH: 24 amino acids 

GHQCQCQGKGRHRADRRPDTAQEE 
15 10 15 20 

SEQUENCE ID NO: 352 

SEQUENCE LENGTH: 23 amino acids 

HQCQCQGKGRHRADRRPDTAQEE 
15 10 15 20 

SEQUENCE ID NO: 353 

SEQUENCE LENGTH: 25 amino acids 

GGHSYGGGPPMPMPGQREAPGRQEA 
15 10 15 20 25 

SEQUENCE ID NO: 354 

SEQUENCE LENGTH: 26 amino acids 

GGHSYGGGGPPMPMPGQREAPGRQEA 
1 5 10 15 20 25 

SEQUENCE ID NO: 355 

SEQUENCE LENGTH: 32 amino acids 

GGHSYGGGGHQCQCQGKGRHRADRRPDTAQ 
1 5 10 15 20 25 30 

E E 

SEQUENCE ID NO: 356 

SEQUENCE LENGTH: 31 amino acids 

GGHSYGGGHQCQCQGKGRHRADRRPDTAQE 
1 5 10 15 20 25 30 

E 

SEQUENCE ID NO: 357 

SEQUENCE LENGTH: 10 amino acids 

APCPQSSGGG 

15 10 
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SEQUENCE ID NO: 358 

SEQUENCE LENGTH: 17 amino acids 
LPAPSQAAADELDRRPG 
15 10 15 

SEQUENCE ID NO: 359 

SEQUENCE LENGTH: 18 amino acids 
TKVRLIRGAPCPQSSGGG 
15 10 15 

SEQUENCE ID NO: 360 

SEQUENCE LENGTH: xx amino acids 

TKVRLIRGGAPCPQSSGGG 

15 10 

SEQUENCE ID NO: 361 

SEQUENCE LENGTH: 26 amino acids 

TKVRLIRGGLPAPSQAAADELDRRPG 
1 5 10 15 20 25 

SEQUENCE ID NO: 362 

SEQUENCE LENGTH: 25 amino acids 

TKVRL IRGLPAPSQAAADELDRRPG 
15 10 15 20 25 

SEQUENCE ID NO: 363 

SEQUENCE LENGTH: 45 amino acids 

CSLAKDGSTEDTVSSLCGEEDTEDEELEAA 
1 5 10 15 20 25 30 

ASHLNKDLYRELLGG 
35 40 45 
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SEQUENCE ID NO: 364 

SEQUENCE LENGTH: 46 amino acids 

GCSLAKDGSTEDTVSSLCGEEDTEDEELEA 
1 5 10 15 20 25 30 

AASHLNKDLYRELLGG 
35 40 45 



SEQUENCE ID NO: 365 

SEQUENCE LENGTH: 21 amino acids 
AAAWQKMAPPRT PRPACVARR 
15 10 15 20 



SEQUENCE ID NO: 366 

SEQUENCE LENGTH: 54 amino acids 

ENSRPKRGGCSLAKDGSTEDTVSSLCGEED 
15 10 15 20 25 3 

TEDEELEAAASHLNKDLYRELLGG 
35 40 45 50 



SEQUENCE ID NO: 367 

SEQUENCE LENGTH: 55 amino acids 

ENSRPKRGGGCSLAKDGSTEDTVSSLCGEE 
1 5 10 15 20 25 3 

DTEDEELEAAASHLNKDLYRELLGG 

35 40 45 50 55 



SEQUENCE ID NO: 368 

SEQUENCE LENGTH: 30 amino du-ds 

ENSRPKRGGAAAWQKMAPPRTPRPACVARR 
1 5 10 15 20 25 30 



SEQUENCE ID NO: 369 

SEQUENCE LENGTH: 29 amino acids 

ENSRPKRGAAAWQKMAP PRTPRPACVARR 
15 10 15 20 25 
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SEQUENCE ID NO: 370 

SEQUENCE LENGTH: 10 amino acids 

HCVLAASGAS 

15 10 

SEQUENCE ID NO: 371 

SEQUENCE LENGTH: 11 amino acids 
GHCVLAASGAS 
15 10 
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SEQUENCE ID NO: 372 

SEQUENCE LENGTH: 28 amino acids 

GTASSRPLGLPKPHLHRPVP IRHPSCPK 
15 10 15 20 25 

SEQUENCE ID NO: 373 

SEQUENCE LENGTH: 27 amino acids 

TASSRPLGLPKPHLHR P V PIRHPSCPK 
15 10 15 20 25 

SEQUENCE ID NO: 374 

SEQUENCE LENGTH: 18 amino acids 
AGTLQLGGHCVLAASGAS 
15 10 15 

SEQUENCE ID NO: 375 

SEQUENCE LENGTH: 19 amino acids 
AGTLQLGGGHCV L? ; A A S G A S 
15 10 15 

SEQUENCE ID NO: 376 

SEQUENCE LENGTH: 35 amino acids 

AGTLQLGGTASSRPLGLPKPHLHRPVPIRH 
1 5 10 15 20 25 30 

P S C P K 
35 
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SEQUENCE ID NO: 377 

SEQUENCE LENGTH: 36 amino acids 

AGTLQLGGGTASSRPLGLPKPHLHRPVPIR 
1 5 10 15 20 25 30 

H P S C P K 
35 

SEQUENCE ID NO: 378 

SEQUENCE LENGTH: 9 amino acids 

RRTPSTEKR 

1 5 



SEQUENCE ID NO: 379 

SEQUENCE LENGTH: 10 amino acids 

RRTPSTEKKR 

15 10 



SEQUENCE ID NO: 380 

SEQUENCE LENGTH: 14 amino acids 
RRTPSTEKKGRSEC 
15 10 



SEQUENCE ID NO: 381 

SEQUENCE LENGTH: 13 amino acids 
RRTPSTEKGRSEC 
15 10 



SEQUENCE ID NO: 382 

SEQUENCE LENGTH: 4 6 amino acids 

STTKCQSGTAETYNSWKVKNLQLEPRRVTS 
1 5 10 15 20 25 30 

QMNRQVKDMTAI LSQS 
35 40 45 
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SEQUENCE ID NO: 383 

SEQUENCE LENGTH: 17 amino acids 
VQPNASQAQQKPTTHGR 
15 10 15 

SEQUENCE ID NO: 384 

SEQUENCE LENGTH: 54 amino acids 

SSEEIKKKSTTKCQSGTAETYNSWKVKNLQ 
1 5 10 15 20 25 30 

LEPRRVTSQMNRQVKDMTAI LS QS 
35 40 45 50 

SEQUENCE ID NO: 385 

SEQUENCE LENGTH: 55 amino acids 

SSEEIKKKKSTTKCQSGTAETYNSWKVKN'L 
1 5 10 15 20 25 30 

QLEPRRVTSQMNRQVKDMTAI LSQS 
35 40 45 50 55 

SEQUENCE ID NO: 386 

SEQUENCE LENGTH: 26 amino acids 

SSEEIKKKKVQPNASQAQQKPTTHGR 
15 10 15 20 25 

SEQUENCE ID NO: 387 

SEQUENCE LENGTH: xx amino acids 

SSEEIKKKVQPNASQAQQKPTTHGR 
1 5 10 15 20 25 

SEQUENCE ID NO: 388 

SEQUENCE LENGTH: 9 amino acids 

NRGWVGAGE 

1 5 
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SEQUENCE ID NO: 389 

SEQUENCE LENGTH: 4 amino acids 

I E A G 

1 

SEQUENCE ID NO: 390 

SEQUENCE LENGTH: 17 amino acids 
VHNYCNMKNRGWVGAGE 
15 10 15 

SEQUENCE ID NO: 391 

SEQUENCE LENGTH: 18 amino acids 
VHNYCNMKKNRGWVGAGE 
15 10 15 

SEQUENCE ID NO: 392 

SEQUENCE LENGTH: 13 amino acids 
VHNYCNMKKI EAG 
15 10 

SEQUENCE ID NO: 393 

SEQUENCE LENGTH: 12 amino acids 
VHNYCNMKIEAG 
15 10 

SEQUENCE ID NO: 394 

SEQUENCE LENGTH: 25 amino acids 

QLRCWNTWAKMFFMVFui IWQNTMF 
1 5 10 15 20 25 

SEQUENCE ID NO: 395 

SEQUENCE LENGTH: 33 amino acids 

VKKDNHKKQLRCWNTWAKMFFMVFLI IWQN 
1 5 10 15 20 25 30 

T M F 
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SEQUENCE ID NO: 396 

SEQUENCE LENGTH: 34 amino acids 

VKKDNHKKKQLRCWNTWAKMFFMVFLII WQ 
1 5 10 15 20 25 30 

N T M F 

SEQUENCE ID NO: 397 

SEQUENCE LENGTH: 11 amino acids 

VKKDNHKKKNS 

15 10 

SEQUENCE ID NO: 398 

SEQUENCE LENGTH: 10 amino acids 

VKKDNHKKNS 

1 5 10 

SEQUENCE ID NO: 399 

SEQUENCE LENGTH: 35 amino acids 

GAEESGPFNRQVQLKVHASGMGRHLWNCPA 
1 5 10 15 20 25 30 

F W S E V 
35 

SEQUENCE ID NO: 400 

SEQUENCE LENGTH: 10 amino acids 
H P S P P P E K R S y *• 
15 10 

SEQUENCE ID NO: 401 

SEQUENCE LENGTH: 11 amino acids 
HPSPPPEKKRS 
15 10 
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SEQUENCE ID NO: 402 

SEQUENCE LENGTH: 44 amino acids 

HPSPPPEKKGAEESGPFNRQVQLKVHASGM 
1 5 10 15 20 25 30 

GRHLWNCPAFWSEV 
35 40 

SEQUENCE ID NO: 403 

SEQUENCE LENGTH: 43 amino acids 

H P S P P PEKGAEESGP FNRQVQLKVHAS GMG 
1 5 10 15 20 25 30 

RHLWNCPAFWSEV 
35 40 



SEQUENCE ID NO: 404 

SEQUENCE LENGTH: 39 amino acids 

MQVLS KTHMNLFPQVLLQMFLRGLKRLLQD 
1 5 10 15 20 25 30 

LEKSKKRKL 
35 



SEQUENCE ID NO: 405 

SEQUENCE LENGTH: 8 amino acids 

RCKSARLI 

1 5 



SEQUENCE ID NO: 406 

SEQUENCE LENGTH: 48 amino acids 

VQTQPAIKKMQVLSKTHMNLFPQVLLQMFL 
1 5 10 15 20 25 30 

RGLKRLLQDLEKSKKRKL 
35 40 45 
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SEQUENCE ID NO: 407 

SEQUENCE LENGTH: 4 9 amino acids 

VQTQPAIKKKMQVLSKTHMNLFPQVLLQMF 
1 5 10 15 20 25 30 

LRGLKRLLQDLEKSKKRKL 
35 40 45 

SEQUENCE ID NO: 408 

SEQUENCE LENGTH: 17 amino acids 

VQTQPAIKKRCKSARLI 

1 5 10 15 

SEQUENCE ID NO: 409 

SEQUENCE LENGTH: 16 amino acids 

VQTQPAIKRCKSARLI 

15 10 15 

SEQUENCE ID NO: 410 

SEQUENCE LENGTH: 11 amino acids 

ARSGKKQKRKL 

15 10 

SEQUENCE ID NO: 411 

SEQUENCE LENGTH: 12 amino acids 
ARSGKKQKKRKL 
15 10 

SEQUENCE ID NO: 412 

SEQUENCE LENGTH: 13 amino acids 
ARSGKKQKKENSF 
15 10 
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SEQUENCE ID NO: 413 

SEQUENCE LENGTH: 12 amino acids 
ARSGKKQKENSF 
15 10 



SEQUENCE ID NO: 414 

SEQUENCE LENGTH: 14 amino acids 
KASARSGKSKKRKL 
15 10 



SEQUENCE ID NO: 415 

SEQUENCE LENGTH: 15 amino acids 
KASARSGKKSKKRKL 
15 10 15 



SEQUENCE ID NO: 416 

SEQUENCE LENGTH: 16 amino acids 
KASARSGKKAKKENSF 
15 10 15 



SEQUENCE ID NO: 417 

SEQUENCE LENGTH: 15 amino acids 
KASARSGKAKKENS F 
15 10 15 



SEQUENCE ID NO: 418 

SEQUENCE LENGTH: 15 amino acids 
HLNKGRRLGDKIRAT 
15 10 15 
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SEQUENCE ID NO: 419 

SEQUENCE LENGTH : 23 amino acids 

VTSGTPFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 420 

SEQUENCE LENGTH: 24 amino acids 

VTSGTPFFFHLNKGRRLGDKIRAT 
15 10 15 20 

SEQUENCE ID NO: 421 

SEQUENCE LENGTH: 10 amino acids 

VTSGTPFFFI 

15 10 

SEQUENCE ID NO: 422 

SEQUENCE LENGTH: 9 amino acids 

VTSGTPFFI 

1 5 

SEQUENCE ID NO: 423 

SEQUENCE LENGTH: 51 amino acids 

VTLLYVNTVTLAPNVNMESSRNAHSPATPS 
1 5 10 15 20 25 30 

AKRKDPDLTWGGFVFFFCQFH 
35 40 45 50 
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SEQUENCE ID NO: 424 

SEQUENCE LENGTH: 60 amino acids 

KCRCKPNFFVTLLYVNTVTLAPNVNMESSR 
1 5 10 15 20 25 30 

NAHSPATPSAKRKDPDLTWGGFVFFFCQFH 
35 40 45 50 65 60 



SEQUENCE ID NO: 425 

SEQUENCE LENGTH: 61 amino acids 

KCRCKPNFFFVTLLYVNTVTLAPNVNMESS 
1 5 10 15 20 25 30 

RNAHSPATPSAKRKDPDLTWGGFVFFFCQF 
35 40 45 50 65 60 

H 



SEQUENCE ID NO: 426 

SEQUENCE LENGTH: 10 amino acids 

KCRCKPNFFL 

15 10 



SEQUENCE ID NO: 427 

SEQUENCE LENGTH: 9 amino acids 

KCRCKPNFL 

1 5 



SEQUENCE ID NO: 428 

SEQUENCE LENGTH: 9 amino acids 

SLVRLSSCV 

1 5 
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SEQUENCE ID NO: 429 
SEQUENCE LENGTH: 14 amino acids 
LVKKLKEKKMNW I L 
1 5 10 

SEQUENCE ID NO: 430 

SEQUENCE LENGTH: 15 amino acids 
LVKKLKEKKKMNWIL 
15 10 15 

SEQUENCE ID NO: 431 

SEQUENCE LENGTH: 10 amino acids 

LVKKLKEKKR 

15 10 

SEQUENCE ID NO: 432 

SEQUENCE LENGTH: 9 amino acids 

LVKKLKEKR 

1 5 

SEQUENCE ID NO: 433 

SEQUENCE LENGTH: 9 amino acids 

AAIVKDCCR 

1 5 

SEQUENCE ID NO: 434 

SEQUENCE LENGTH: 11 amino acids 
SQPAS ILGRKL 
15 10 

SEQUENCE ID NO: 435 

SEQUENCE LENGTH: 12 amino acids 
SQPAS I LGKRKL 
15 10 15 
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SEQUENCE ID NO: 436 

SEQUENCE LENGTH: 18 amino acids 
SQPAS ILGKAAIVKDCCR 
15 10 15 

SEQUENCE ID NO: 437 

SEQUENCE LENGTH: 17 amino acids 
SQPAS I LGAAIVKDCCR 
15 10 15 



SEQUENCE ID NO: 438 

SEQUENCE LENGTH: 18 amino acids 
KSLVRLSSCVPVALMSAM 
15 10 15 

SEQUENCE ID NO: 439 

SEQUENCE LENGTH: 9 amino acids 

RLSSCVPVA 

1 5 

SEQUENCE ID NO: 440 

SEQUENCE LENGTH: 9 amino acids 

VRLSSCVPV 

1 5 

SEQUENCE ID NO: 441 

SEQUENCE LENGTH: 9 amino acids 

LVRLSSCVP 

1 5 

SEQUENCE ID NO: 442 

SEQUENCE LENGTH: 9 amino acids 

SCVPVALMS 

1 5 
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SEQUENCE ID NO: 443 

SEQUENCE LENGTH: 9 amino acids 

SSCVPVALM 

1 5 

SEQUENCE ID NO: 444 

SEQUENCE LENGTH: 9 amino acids 
LS SCVPVAL 
1 5 



SEQUENCE ID NO: 445 

SEQUENCE LENGTH: 9 amino acids 

VPVALMSAM 

1 5 

SEQUENCE ID NO: 44 6 

SEQUENCE LENGTH: 9 amino acids 

CVPVALMSA 

1 5 

SEQUENCE ID NO: 447 

SEQUENCE LENGTH: 9 amino acids 

KKKSLVRLS 

1 5 

SEQUENCE ID NO: 448 

SEQUENCE LENGTH: 9 amino acids 

EKKKSLVRL 

1 5 

SEQUENCE ID NO: 44 9 

SEQUENCE LENGTH: 9 amino acids 

KEKKKSLVR 

1 5 
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SEQUENCE ID NO: 450 
SEQUENCE LENGTH: 9 amino acids 
MKEKKKSLV 
1 5 

SEQUENCE ID NO: 451 

SEQUENCE LENGTH: 9 amino acids 
IMKEKKKSL 
1 5 



SEQUENCE ID NO: 452 

SEQUENCE LENGTH: 9 amino acids 

KCIMKEKKA 

1 5 

SEQUENCE ID NO: 453 

SEQUENCE LENGTH: 9 amino acids 
C IMKEKKAW 
1 5 

SEQUENCE ID NO: 454 

SEQUENCE LENGTH: 9 amino acids 

C IMKEKKKA 

1 5 

SEQUENCE ID NO: 455 

SEQUENCE LENGTH: 9 amino acids 

IMKEKKKAW 

1 5 

SEQUENCE ID NO: 456 

SEQUENCE LENGTH: 13 amino acids 
HPSWPWTRCLRMR 
15 10 
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SEQUENCE ID NO: 457 

SEQUENCE LENGTH: 14 amino acids 

RHPSWPWTRCLRMR 

15 10 

SEQUENCE ID NO: 458 

SEQUENCE LENGTH: 16 amino acids 

GASGCVHQEAERVSQA 

15 10 15 



SEQUENCE ID NO: 459 
SEQUENCE LENGTH: 20 amino acids 
NTWAKMFFMVFLI IWQNTMF 
15 10 15 20 
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